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ABSTRACT 

Frequency analysis of hydrologic dat.a is a knowledge domain 
where lot of decisions are to be taken based on human 
intuition, experience, and subjective judgement. Being a data 
dependent technique the reliability of frequency analysis is 
always questionable. To fit a theoretical distribution function 
to a set of observations and to draw conclusions therefrom, 
require a combination of routine numerical/statistical 

computation and human expertise in the form of heuristics. In 
order to address these problems, a program. Frequency Analysis of 
Continuous Hydrologic Variables with an embedded Expert System 
(FACHVES) has been developed. This program was developed in 
FORTRAN for microVAX-VMS environment. It consists of methods for 
computerised statistical analysis supported by an expert decison 
support system. The framework consists of four parts'- main 
program, subroutines, expert system interface, and a knowledge 
base. The developed work completed so far is preliminary in 
nature and it provides the framework for future development of a 
more comprehensive computer program package . 

The developed package for FACHVES was tested with t-wo 
different data sets representing recorded streamflow data at two 
different sites in India. The results of this testing shows the 
proper functioning of the package at this stage of development. 
These results also established the feasibility of this approach 
for statistical analysis of the hydrologic data by a combination 
of computational procedures and expert advice. 



CHAPTER 1 


INTRODDCTION 


1.1. General 

With the development of digital computers, engineering 
design and analysis got new dimensions. Fast and efficient 
computing due to computers made several complex analysis 
techniques, e.g., numerical analysis stati.stical analysi.s etc., 
computationally feasible. Higher level languages like FORTRAN, 
BASIC, C and PASCAL were developed and a number of programs 
useful for engineering problem solving have been written in these 
languages. Though initially these programs were useful to solve 
a problem, they were not able to solve complex problems involving 
a number of ordinary and partial differential equations, logical 
constraints, or analyse problems involving uncertainties. 
Therefore for the processes which are not well understood, 
involving uncertainties due to lack of data in quality and 
quantity, simulation analysis had to be resorted to. Later on it 
was realised that in several cases a number of complex problems 
depended upon intuition and subjective judgment of the analyst 
and could not be solved by routine analytic techniques. 
Development of languages like LISP and PROLOG in which logical 
approaches were easier to be programmed, facilitated the design 
of a system to sort out these problems. Initially the intention 
behind these approaches was to design such a computing system 
which can think and derive conclusions from logical and 



numerical analysis Just as a human being. After a number of 
attempts the researchers in this field realised the impossibility 
of such a universally wise system. The recent trend of research 
in this field is to design limited systems for specific domains. 

Water resources engineering (WRE) deals with many processes 
whose physics is not well understood and many a time decisions 
are to be taken on the basis of past experience, intuition- etc. 
Moreover data available for analysis are also subject to many 
observational or computational errors. Therefore problem solving 
techniques in this field of engineering design also need such a 
system which can incorporate the domain specific expertise with 
computational techniques. 

Thus computer aided analysis and design in engineering seems 
to be presently (1990) undergoing a dramatic change in that the 
numerical, analytic and logical capabilities of a digital 
computer are extended with the heuristic capabilities of 
artificial intelligence and expert systems in efficiently solving 
problems in the areas of analysis and design. 

1.2 Water Resources and Expert System 

1.2.1 General 

Intelligence in human nature is associated with one or more 
of the following abilities: 

to respond to situations very flexibly 

* . to make sense out of ambiguous or contradictory messages 
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*. to recognise the relative importance of different 
elements of a situation 

*. to find similarities between situations despite 
differences which may separate them; and 
* to draw distinction between situations despite 
similarities which may link them. 

Above abilities are nothing but common sense of people and 
they come naturally to them by experience, knowledge of 
heuristics and largely due to mental faculties. Artificial 
Intelligence (AI) is simply a way to impart these abilities, 
i.e., intelligence to a computer (Levine, 1988). This can be 
accomplished by studying how people think when they are trying to 
make decisions and solve problems; breaking those thought 
processes down into basic steps; and then designing a computer 
program that solves problems using those same steps. Fig. 1.1 
schematically shows these steps. Thus AI can be defined as that 
part of computer science concerned with designing intelligent 
computer systems, i.e., systems that exhibit the intelligent 
characteristics of human beings like understanding language, 
learning, reasoning, solving problems and so on. 

The field AI encompasses robotics, game playing, the 
automated translation of language and a major subdiscipline 
called Expert System (ES). ES is concerned with the development 
of computer software that can partially represent human knowledge 
and utilise that knowledge to solve complex problems within a 
specific domain (Johnston, 1985). 



NEW KNOWLEDGE GAINED 
FROM TESTING MODELS 



COGNITIVE AI RESEACHERS 

SCIENTISTS 


FIG. 1.1 : "The Feedback Loop" in AI Computer Modelling 

(Mishcoff, 1986) 







1.2.2 Need of ES in waier resources engineering 


Water resources engineering deals with planned development 
and management of water, the spatially and stochastically varying 
natural resource above and below ground. Optimal utilisation of 
water requires the evaluation of the occurrence, distribution and 
variability in time and space of the water resources as well as 
their extremes like flood and droughts. In all the four stages 
of designing a water resource system, viz. planning, development, 
management and design, a large amount of uncertainties are 
involved. These uncertainties can arise due to hydrology, 
hierarchical and multistage nature of decision making, financial 
and economic variabilities, and implementation techniques, social 
constraints, and changing national conditions. An efficient 
approach to the design of a water resource system should be able 
to take care of these uncertainties as far as it can. A large 
amount of domain dependent expertise is needed in planning, 
design, construction and integrated operation of water resource 
systems and they are often heuristic in nature. Therefore, water 
resources engineering has a potential and justification for ES 
applications . 

1.2.3 ES applications in water resources engineering 

For many areas of water resources ES are being developed and 
applied. They include areas like database management, 
information systems, and water quality management (Smoilier, 
1986; Datta and Peralta, 1986; Arnold et al, 1989; Simonovic, 
1989; Datta etal, 1990; ); selection of design data as design 



storm (Nielson, 1986) or flood estimation (Fayegh and Russel, 
1986) and appropriate treatment technology for water 
supply/sewage (Arnold, 1986). ES has been applied also for data 
analysis (Wilson, 1986), hydrologic modeling and parameter 
estimation (Engman, et al, 1986; Delleur, 1988), tank irrigation 
system (Oswald, 1989), choice of model to be used and preparation 
of input data for a reservoir system (Savic and Simonovic, 1989) 
and for an urban storm sewer system (Lindberg and Nielson, 1986), 
Also, management of multipurpose system of reservoirs integrated 
with conjunctive use of surface water and ground water are 
complex problems which need human expertise, common sense and 
heuristics. ES like SERPES for sewage rehabilitations planning 
process and WADNES for handling emergencies in a water supply 
network (Ahmad et al , 1989) have been successfully implemented. 

Datta and Peralta (1986) proposed a methodology for 
embedding a statistical pattern recognition system within an 
expert decision support system for identifying unknown sources of 
ground water pollution. This method was developed and tested for 
application (Datta et al, 1990). The concept of combining the 
decision making capabilities of an expert decision support system 
and a geographical information system (intelligence GIS) was 
proposed in Arnold et al, (1989). The primary purpose of such 
a system is to incorporate heuristic knowledge regarding 
measurement and other uncertainties in the estimation of 
hydrologic or water quality variables. 

These applications clearly show the possibilities of ES 
application to water resource engineering. 



1 . 3 Objective of the Study 


Water resources engineering encompasses a vast domain. 
Since it is not possible to consider in this study each and every 
aspect, a particular area, viz., only probabilistic approach has 
been considered and frequency analysis of hydrologic data is 
chosen. Frequency analysis of hydrologic data is a necessary 
first step in design of hydraulic structures, control of extreme 
hydrologic events management of water resources etc. This field 
of water resource engineering was found interesting and suitable 
for ES applications as it deals with data lacking in quality and 
quantity, having observational and computational errors and 
sparseness. Moreover, there is no universally acceptable 
methodology for fitting a probability distribution to a given set 
of data and if at all, only a limited knowledge about the parent 
distribution of samples may be available. Physical processes 
resulting in a high or low precipitation or streamflow are also 
not well understood. Often the presence of outliers, measurement 
errors or occurrence of rare combination of physical processes or 
multiple distributions makes the problem more complicated. Hence 
an efficient approach to frequency analysis requires 
incoirporation of statistical inference tools as well as heuristic 
knowledge of knowledgeable human experts in this field. Moreover, 
ES application to frequency analysis seems to be a virgin area of 
study. The major objectives of the study are as follows: 

1. To develop a framework for an ES for frequency analysis 
of hydrologic data. 



2. To implement an Interactive FORTRAN based program with 
an ES in I IT Kanpur computing environment; and 

3. To develop and test an interactive program for frequency 
analysis of hydrologic data. 

1.4 Scope of the Study 

A variety of computer program© in Fortran language are 
available in IIT Kanpur for fitting one or more probability 
distribution. Furthermore an HP-9000 computer system with a 
Unix operating system, a MicroVAX II computer system with a VMS- 
VAX operating system and IBM PCs, are available for the study. A 
number of ES shells of different capabilities are also available. 
However the scope of study was limited because of time constraint 
and has been limited to: 

1. Fortran based program with an ES tool (CLIPS, as identified in 
Sec . 2.4.1) 

2. Frequency analysis of continuous hydrologic variables and 
testing only a few sets of data 

3. A limited number of methods and approaches to frequency 
analysis, and 

4. MicroVAX- VMS environment. 

1.5 Organisation of the Study 

The study is reported in the following sequence: 

1. Introduction to expert system; need and application in water 

resources engineering; objectives, scope, and organisation of thel 
study (Chapter 1). 
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2. A brief int-roducbion to expert systems, building expert 
systems, different expert system tools, and selection of ES 
tool (CLIPS) for the study (Chapter 2). 

3. Introduction to CLIPS and some of its salient features 
( Chapter 3 ) . 

4. Problems involved in frequency analysis in hydrology; 
expertise available in frequency analysis; introduction and 
flow charts of the FACHVES (Frequency Analysis for 
Continuous Hydrologic Variables with Embedded Expert System) 
program, application, and discussion of results (Chapter 4) 
and 

6. Summary, conclusion and suggestions for future study 
( Chapter 6 ) . 



CHAPTER II 


EXPERT SYSTEMS 


2 . 1 Introduction 

Human experts in any field are frequently in great demand 
and are also generally in short supply. AI presents a solution 
to this problem through an expeirt system (ES) which is a 
computing system capable of representing and reasoning about some 
knowledge-rich domain with a view to solving problems and giving 
advice (Jackson, 1986). It is also known as knowledge - based 
expert system (KBES). 

Gasching (1981) defines KBES as "interactive computer 
program incorporating judgment, experience, rules of thumb, 
intuition and other expertise to provide knowledgeable advice 
about a variety of tasks". All the knowledge in ES is provided 
by people who are expert^ in that domain and it has got expert 
information essentially for the given domain. So ES’s act as 
intelligent assistants to human expert, and also provide 
assistance to people who otherwise might not have access to 
expert advice. 

2.2 Difference Between KBES and Other Programs 

Although both KBES and database programs feature the 
retrieval of stored information, the two types of programs differ 
greatly . 
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A database program retrieves facts that are stored, while at 
KBES uses reasoning to draw conclusions from stored facts (Fig. 
2.1). Interactive programs that incorporate graphics and some 
sort of expertise in the form of constraints and limitations, 
assumptions and approximations can be considered to give results 
as advices rather than answers. Yet KBES differs from 
traditional computer programs in that (Adely, 1985); 

1. Expert systems are knowledge intensive programs; 

2. In a rule based expert system, expert knowledge is usually 
divided into many separate rules; 

3. The rules forming a knowledge base or expert knowledge is 
separated from the methods for applying the knowledge to the 
current problem. These methods are referred to as inference 
mechanism or rule interpreter; 

4. Expert systems are highly interactive; 

5. Expert systems have user friendly intelligent user interface; 
and 

6. Expert systems, to some extent, mimic the decision making and 
reasoning process of human experts. They can provide advice, 
answer questions and justify their conclusions. 

Nonintelligent programs generally follow a well defined 
algorithm that specifies explicitly how to find the output 
variables for any given input variables. It is called procedural 
programming. On the other hand, in the intelligent program used 
in KBES the behavior of the program is not explicitly described 
by the algorithm. The sequence of steps followed by the program 
is influenced by the particular problem presented to it. This is 




FIG. 2.1 DATABASE PROGRAM VS. ES 
(Harmon and King, 1985) 
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called declarative prograunmlng. Declarative programming is more 
efficient than procedural progranuning in that it enables computer 
to absorb new information at intermediate steps and further 
action can be referred to without unnecessarily going all the way 
back to the beginning of the program or disturbing the already 
established other facts (Levine, 1988). 

2 . 3 Building Expert Systems 

2.3.1 Architecture of KBES 

A KBES generally has four principal components (Fig. 2.2). 
They are, a knowledge base, working memory, and inference engine 
and a user interface. As KBES vary in design, they may have 
other components also, e.g., graphics ^system analysis and other 
software. 

a. Knowledge base. A knowledge base contains both declarative 
knowledge (facts about objects, events and situations) and 
procedural knowledge (information about courses of action) which 
may be scientific, analytic or heuristic rules (Fig. 2.3). 
Although many knowledge representation techniques have been used 
in ES, the most prevalent form of knowledge representation 
currently used in ES is the rule based production system 
approach. The rules have generally two parts, conditions and 
actions. The rules are fired when the conditions are matched 
with the facts. The actions can be for processing instructions 
or control instructions. The rules may include metarules which 
are rules about rules . 




FIG. 2.2 


ARCHITECTURE OF A TYPICAL ES 
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DOMAIN OF EXPERT SYSTEM 
KNOWLEDGE BASE 


FACTS 


HEURISTICS 

ABOUT 


FOR 

DOMAIN 


DOMAIN 


FIG. 2.3 : THE COMPONENTS OF THE KNOWLEDGE BASE OF AN 
ES (Schank and Childers, 1984) 







16 


b. Working memory. It is the current active set of the knowledge 
base and may include a knowledge management module. 

c. Inference engine. It is the most crucial component of ES 
since it matches and manipulates the database for problem 
solving. It is the inference mechanism which also provides 
justification for advice from ES. Three formal approaches used 
in this case are production rules, structured objects and 
predicate logic. Production rule consists of a rule set, a rule 
interpreter which specifies when and how to apply the rules and 
working memory that holds data, goals or intermediate results. 
Structured objects use vector representation of essential and 
accidental properties. Predicate logic uses prepositional and 
predicate calculus. 

d. User interface. It is the communication module which provides 
bi-directional exchange of information between user and system. 

2.3.2 ES Techniques 

The order of execution of the rules and/or procedures in an 
ES is governed by the inference engine in terms of the problem 
solving strategy used. Maher (1986) considers two approaches: 

a. The derivation approach: It involves deriving a solution that 
is most appropriate for the problem from a list of predefined 
solutions stored in the knowledge base of ES. It includes 
forward chaining (or goal driven control strategy); backward 
chaining (or data driven control strategy) and a hybrid strategy 
combining both these strategies . Forward chaining works from an 



initial state of known facts to the goal state and backward 
chaining works from a hypothetical goal state to the facts 
perhaps in terms of subgoals. The subgoals are preconditions for 
the goal stated. If the hypothesis is not supported by facts, it 
tests for another goal state and so on in a predefined order of 
goals . 

b. The formation approach: It involves forming a solution from 
eligible solution components stored in the knowledge base. It 
includes problem reduction (into subprograms), plan - generate 
test (which generates all possible solutions, prunes inconsistent 
solutions and tests the remaining solutions), and agenda control. 
In agenda control a priority rating to each task in the agenda is 
assigned and the tasks are performed according to the assigned 
priority. 

These techniques may be combined with other techniques for 
hierarchical planning, least commitment backtracking and 
constraint handling (Maher, 1986). Some other techniques 
available Include inductive inference, metareasoning, ill 
structured problem and data handling etc. 

2.3.3 Developing an ES 

Developing an ES is a time consuming team work. 
Particularly, for developing a sophisticated ES an intensive and 
coherent effort is required. Knowledge engineers emd domain 
experts work together to design an ES. The knowledge engineer 
develops the expert system, and the domain expert provides the 
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inf orma-tion for the knowledgebase. Haye§ - Roth et al (1983) 
have identified five sequential stages in the development of ES, 
shown schematically in Fig. 2.4. Each stage is iterative in 
nature. Some of the stages are shown in Fig. 2.5. 

2.4. ES Tools 

A wide variety of development tools and environments are 
available for ES. These tools can be one of the general purpose 
programming languages or an ES shell. An ES shell is a set of ES 
development programs containing no knowledge about a problem, but 
can be 'taught’ in a particular field or other. They contain all 
the modules required for ES. Filling of their hollow 
knolwdgebase makes them knowledge based expert system. Broadly, 
these ES tools are catagoriesed as: 

1. Programming languages like PASCAL and C; 

2. AI based exploratory programming languages like LISP and 
PROLOG; 

3. ES shells like VIDHI which may be based on item "2"; 

4. High level ES programming environments like 0PS5, ART, KES, 
NEXPERT, PC PLUS, RULE MASTER, CLIPS etc.; and 

5. Mixed programming environments which allow the programmer to 
mix programming languages as in item "1" and item "2" with 
high level ES programming environment as in item ”4*'. 
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IDENTIFICATION 


DETERMINING THE CHARACTERISTICS 
OF THE PROBLEM 


CONCEPTUALISATION 


FINDING CONCEPTS TO REPRESENT 
THE KNOWLEDGE 


FORMALISATION 


DESIGNING STRUCTURES TO 
ORGANISE THE KNOWLEDGE 


IMPLEMENTATION 


FORMULATING RULES THAT 
EMBODY THE KNOWLEDGE 


TESTING 
VALIDATING THE RULES 


B 


FIG. 2.4 : FIVE STAGES OF ES DEVELOPMENT 

(Adapted from Hayes - Roth et al, 1983) 






FIG. 2.5 : ITERATIVE STAGES OF ES DEVELOPMENT 









2.5 Selection of ES Tool for Study 


ES tools and shells available for the study indue 
prograniTning languages like FORTRAN, C, PASCAL etc. , exploratoi 
programming languages LISP and PROLOG and an ES shell as VIDHI s 
well as ES tools and environments including INSIGHT, PC PLUS 
RULE MASTER, and CLIPS. However, the choice of the tools depend 
upon the hardware and software available in IIT Kanpur and th 
topic of the study. 

Several computer programs and subroutines developed eithe 
in IIT Kanpur or outside for frequency analysis of hydrologi 
data are available for the study. They include a compute 
program FAP for frequency analysis of seasonal hydrologic dat 
using different distributions and methods of paramete 
estimation: a computer program for fitting normal distribution b 
methods of least square; programs for transforming data with non 
normal distribution to those of normal or near norma 
distribution; a number of computer programs for fitting differen 
distributions to hydrologic data (Kite, 1977); and a number o 
computer programs developed by NIH Roorkee, HEC programs of Ui 
Army Corps of Engineers, Hydrologic Engineering center of US Arm; 
Corps of Engineers etc. Furthermore, the hardware and softwari 
facilities available in IIT Kanpur computing environment includi 
HP 9000-800 system with UNIX operating system; microVAX If 
computer system with VMS-VAX operating system and IBM PC systems: 
Programming languages FORTRAN, PASCAL and C are available in all 
the three systems. Because of extensive library of program: 
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available in FORTRAN language for the study, it was proposed to 
use expert system in a mixed environment so that these FORTRAN 
programs can be accessed as necessary. 

A brief comparison of the capabilities of the ES tools 
available for the study shows that CLIPS has a variety of 
features very much suitable for IIT Kanpur computing environment 
and interaction with FORTRAN programs: 

1. It is written in C language for which compilers are available. 

2. It is possible to call functions from CLIPS for numerical 
analysis, database, graphics etc. 

3. It is possible to embed a CLIPS program within programs 
written in C, FORTRAN etc. and vice versa. 

4. There is no limitation on length of embedded program other 
than the memory of the computer. 

5. It can be ported to all the three computer systems available 
for the study. 

Hence CLIPS is chosen as the ES tool in a mixed programming 
environment with FORTRAN for developing an ES for frequency 
analysis, specially in a VAX VMS environment. 



CHAPTER III 


CLIPS : AN ES SHELL 


3. 1 Introduction 

CLIPS ( 'C' Language Integrated Production Systein) is a rule 
based forward chaining mixed programming expert system shell 
developed by National Aeronautics and Space Administration 
(NASA), Houston, Texas. The source programs of CLIPS (Version 
4.3) are written in 'C’ language. The primary representation 
methodology is a forward chaining rule language based on Rate 
algorithm (Forge, 1982) for multiple pattern matching and 
conflict resolution. 

CLIPS can be used to develop a KBES in two ways, either as an 

1. Interactive executable element; or 

2. Embedded executable element. 

In both ways the primary methodology remains the same with a 
difference in command syntax and in string conversion (if 
embedded with a language other than 'C’). CLIPS provides a 
programming environment in which a knowledge base* can be created 
and facts can be added, removed or generated. The inference 
engine attempts to arrive at a conclusion based on the defined 
set of domain specific knowledge base and current state of facts. 

[* It may be noted that bold letters used in this Chapter other 
than the titles indicate reserved terms in CLIPS. ] 
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3.2 Basic Features 

3.2.1 Knowledge representation 

a. Facts: A fact represents a piece of information and is placed 
in a current list of facts, the fact-list. It acts as a pattern 
for matching the conditions of a rule in order to fire that 
particular rule. Any amount of facts can be stored in the fact- 
list limited only by the memory of the computer. Facts can be 
added (asserted) or removed (retracted) in two ways, viz., prior 
to execution and by the action of a rule firing. A fact can be 
a number, a word or a string. 

A number is any field which consists only of numbers (0-9), 
a decimal point (.), a sign ( + or -), and optionally an (e) for 
exponential notation with its corresponding sign. The number of 
significant digits and round of error may also occur depending 
upon machine implementation. 

A word in CLIPS is any set of characters that starts with 
an alphabetic character and is followed by zero or more letters 
(A - Z), number (0 - 9), under scores (_) or dashes (-). The 
word ends with a space. Space or other special characters may 
not be included within a word and furthermore CLIPS is case 
sensitive. 

A string is a set of characters that starts with a double 
quotes (") and is followed by one or more letter (A - Z), numbers 
(0 - 9), underscores (_), dashes (-), spaces, or special 
character (any printable character). A string ends with double 



„ v.ie Quotes or other special characters may be 
quotes. Double ** 

, ,, jxv-in a field by pressing a backslash (\) in front of 

embedded wltniu 

the character - 

, j! ^+‘1 are of of free form (no reserved words or 

CLIPS faCT'i’ 7 

, .first fact in the fact list is called initial-fact 

order). The 

, ^+e>.d by the system during a reset. 

and it is assertea 

Defining facts : With the def facts construct, facts can 

. , , , ^ +v,p. initial fact list. The asserted facts can be 

be added to 

^ , other facts. The initial fact-list, including any 

treated as any 

, is always reconstructed after a reset, 

defined def facts* 




(def facts 


<naine> C " < comment > *’ ] 


(<<fact 1>>> 


(«fact 


n»)]) 


is a word used to identify the set of facts. 

where <nain®'^ 

Comment is opti*^^®^ ’ 

Amary method of representing knowledge in CLIPS is 

b. Rule: The 

is a collection of actions to be taken if the 

a rule. A rul® 

^et.The conditions are patterns which act as 
conditions are ™ 

j ais*^ provide a way to describe how to solve a 
constraints an^ 


problem. 
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Defining rule: Rules are defined using defrule construct. Each 
rule in CLIPS must have at least one condition and one action. 
There is no limit other than the actual memory of computer to the 
number of conditions or actions, a rule may have. 

Syntax 1 

( defrule <name> ["<comment>"] 

[ . 

; Left-Hand Side 
; (LHS) 

(<<nth pattem>>)3 
=> 

(<<first action>>) 

C . 

: Right-Hand Side 
(RHS) 

( <<mth action» ) 3 ) 

where <name> is the name of the rule, a word and "<comment>" 
is optional. 

LHS of the rule is made up of a series of one or more 
patterns which represent the condition elements for the rule. 
There is always an implicit (automatically implied) and 
surrounding all the patterns on the LHS. However or, not, 
explicit and (provided explicitly) or test (for mathematical or 
logical constraints), or a combination of them can also be 
incorporated. A rule priority can be assigned to a particular 
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rule by providing, in addition to other patterns, a declare 
pattern which declares its salience value ranging from a highest 
value of 10000 to a lowest value of -10000 . Single field 

pattern or multiple field patterns can be generated by proper 
syntax . 

In RHS a list of the actions to be performed when LHS of 

the rule is satisfied is given. The arrow (=>) separates RHS 

from LHS. An action in CLIPS can be of many types. It can 

create new facts by assert action, can call an external function, 

can retract a fact, assert a string and bind a variable to a 

number or word (assign a value to the variable). Opening or 

closing a file, stopping CLIPS by halt, formatting by format and 

mathematical or logical predicate functions are some of the more 

general actions that can be incorporated. Two important features 

of the RHS actions if-then else structure and while structure are 

defined as follows: 

Sz.iLtax..i 

I. 

(if (<predicate function> [<< args ....>>3 
then 

(<<action 1») 

(<<action n>>) 
felse 

(<<action 1») 


(<<action m»)3) 
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II. 

(while <predica'be fujaction> t<<arffs. . . >>] ) 

(< Oct ion 1>>) 

(<<actioiii n»)) 

All predicate functions available in CLIPS are given in 
CLIPS Manual Part I (Culbert, 1987). 

3.2.2 Inference engine 

The inference engine of CLIPS is rule based and as indicated 
earlier (Sec. 2.3.1 and Sec. 3.1) works on a forward chaining 
inference roechanisni. It is based on Rete algorithm (Forge, 1984) 
for multiple pattern matching and conflict resolution. 

3.2.3 Basic cycle of execution 

Unlike a conventional language the starting and stopping 
points are not explicitly defined by the programs in CLIPS. The 
inference engine applies the knowledge (rules) to data (facts). 
The basic cycle of execution is as follows: 

1. The knowledge base is examined to see if the conditions of any 
rule has been met. 

2. All rules whose conditions are currently met are activated and 
placed on the agenda (a stack). Rules having higher priority 
are kept on the top of the stack and activated before the new 
rule. Rules having lower or equal priority remain below the 
new rule. 

3. The top rule on the agenda is selected and its RHS actions 


executed. 
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As a result of RHS actions new rules can be activated or/and 
deactivated . 

This cycle is repeated until all rules that can fire have 
done so, or until a so called rule limit is reached 
(Appendix. A. ) . 

3 . 3 Embedded Application of CLIPS 

CLIPS has an added advantage of being integrated with, 
external functions, and/or ‘C’ , FORTRAN or Ada language. This 
capability makes its application very flexible and more suited to 
engineering applications where numerical computations are of 
great importance, and where computer programs in any of the 
languages are already available. 

3.3.1 External function 

An external function defined by the user for his specific 
use in problem solving can be either in ‘C’ or in the language 
within which CLIPS is being embedded. An external function can 
be used on both the LHS and the RHS of rules. Data can be pas.sed 
to and from them. 

These functions are defined under usrfuncs either in file 
raain.c (in interactive mode) or any other user file (in embedded 
application). Within usrfuncs a call should be made to the 
define - function routine for every function, the user wants 
CLIPS to know about. 



30 


Example 

usrf uncs ( ) 

{ 

def unction ( "fun" , ‘s’, fxan, "fun); 

define_f unction ( "dummy" , ‘ i ’ , my_dummy , ”my_duimny“ ) ; 

/* Additional define function statements could go here. 

} 

In define_f unction: 

First argument is a CLIPS name, a string 
representation of the name that will be used inside 
CLIPS rule. 

Second argument is a return type as 'i’=integer, 
'f’=float, 'c’= character, 's’=pointer to a character 
string, ‘w’=pointer to a character word, ‘u’=pointer 
to an unknown data type. 

Third argument is a pointer to the actual function (an 
external declaration of the function may be 
appropriate). The first argument need not be same as 
the actual function name, and 

Fourth argument is a string representation of the 
actual function name. The name must be exactly the 
same as the function name, namely the third argument. 

User defined functions are searched before system functions 
and if it matches with one of the defined functions already 
provided, the user function will be executed in its place. 
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a. Passing variable from CLIPS bo exbemal funcbion CLIPS 
actually calls the function without any arguments, though they 
are listed directly following a function name inside CLIPS rules. 
Instead the parameters are stored internally by CLIPS and can be 
accessed by calling the functions: 

Int nui»_arg ( ) ; 
char rstring (arg); 
float rfloat (arg); 
int runlcDown (arg); 
int arg; 

A call to nuin_arg will return an integer telling how many 
argument. s the function was called with. 

A call to rstring returns a character pointer, and rfloat 
returns a floating point number. The parameters have to be 
requested one at a time by specifying the parameter position 
number as the argument to rstring or rfloat. If the type of the 
argument is unknown, runknown can be called to determine the 
type. 

b. Passing data from external function to CLIPS An external 
function can pass data into CLIPS in two ways. It can return a 
value or can assert a new fact directly into the CLIPS fact 
list. If the external function is to be used as predicate, it 
must return a floating point number^ otherwise it can be a 
character, integer, word or unknown. Return values can be used 
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as predicates, bound to variables, or captured via pattern 
expansion. The return value does not have to be captured, but 
roust be defined in CLIPS, and all external functions must return 
a value. 

The other method of passing data, asserting a new fact 
directly into the CLIPS fact list, is done by calling the C 
function assert. 

Example 

FACT * assert (string): 

char * string ; 

Here string is a single string made up of floating point 
numbers and words. The return value is a pointer to a variable 
of type FACT. 


£]aPSl.i oh sail Resulting Fact 

(from external function) (in CLIPS fact-list) 

assert (“Earn is a boy") (Ram is a boy) 

CLIPS provides some more advanced interface functions in 
•which passing known variable types, accessing multifield 
variables, and building facts by scratch (to assert lots of 
facts) are possible (Culbert, 1987). 

3.3.2 Embedded application: 

a. General CLIPS is designed to be embedded within other 
programs. The embedding program can be a 'C’ language program or 
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a FORTRAN program or an Ada program. In each case a main program 

is provided by the user which calls CLIPS like any other 

subroutine. The basic changes which are to be made to access 

CLIPS from main program in a different language are: 

1. Object version of main.c is removed from the link list of 
programs given in the CLIPS manual (Culbert, 1987); 

2. Prior to loading rules, CLIPS is to be called by calling the 
function init_clips 

3. If CLIPS calls any external function, user functions must be 
defined . 

4. Many of the capabilities which are available in interactive 

interface are available by proper function calls 

(Culbert, 1987). 

5. If the embedding main program is in ‘C’ it must include two 
include statements, namely 

# include <stdio.h> 

# include "<clips.h>" 

6. If the embedding language is other than 'C' two modifications 
are to be made. They are: 

i. A special function must be defined to run CLIPS from FORTRAN 
(for Ada it is not needed). This function is run_clips and it 
converts the rule firing parameter before calling the CLIPS 
run function (vide Appendix A). 

ii. Since each language and each machine passes parameters 
differently, and since the source code of CLIPS is written in 
‘C’ language, every string passing from languages other than 
'C' should be converted to a *C’ string and vice versa. 
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In the present study CLIPS has been embedded within a 
FORTRAN main program and so the FORTRAN-CLIPS interaction is 
dealt within greater details in Sec. 3.3,2. Since the package 
has been implemented on microVAX II only, the machine dependent 
features are not discussed. The commands for linking and 
execution for VAX -VMS environment are given in Appendix B, 

b. Fortrcin - CLIPS interaction 

For complete language mixing four basic capabilities are 
needed 

1. A program in another language may be used as the main program 
and CLIPS can be called as needed for reasoning. 

2. Facts can be asserted into CLIPS from other languages 

3. CLIPS may call other functions written in any language from 
the RHS of a rule and may pass parameters to the function. 

4. In languages which can provide a meaningful return value, 
external functions may be called from the LHS of a rule (i.e. 
used as predicate function). 

The main program written in FORTRAN initialises CLIPS, loads 
the rule files, resets the process, asserts the facts and runs 
the program by calling specialised CLIPS functions. The 
functions are; int_clips, load_rules, reset_cllps, assert, 
ruii_clips in a sequence. In order to load a rule file or to 
assert a fact the Fortran strings are to be converted into C- 
strings by calling a function storec. Similarly for reverse 
action, i.e., to pass a parameter from CLIPS to Fortran, the C- 
string is to be converted into Fortran by calling a function 


loadc. 



Facts can be asserted to CLIPS from Fortran either as 
constraints or as variables. In both cases it must be a string 
(see Examples 1 and 2) 

example 1 Asserting a constraint fact: 

Fianctlbn iiall A sse r t s In ilae CLIPS fact-list 

FACTl = 'Ram is a boy’ 

CALL STOREC (FACTl, C_FACT) 

CALL ASSERT (C_FACT) (Ram is a boy) 

example 2 Asserting a variable fact: 

Ean gMon s&ll A sser t s is ills CL IP? fact -list 

100 FORMAT (E 10.4) 

X=4.5 

WRITE (INT_JFILE. 1000 )X 

FACT1= 'X is * // INT_FILE(1:10) 

CALL STOREC (FACTl, C_FACT) 

CALL ASSERT (C_FACT ) (X is 4.5) 

The argument in storec function FACTl is declared as integer 
and C_FACT as a character. 

The FORTRAN program which interacts with CLIPS is listed in 
Appendix C. 

Furthermore, to call a function from CLIPS which is not 
defined for CLIPS, it is necessary to define an external function 
or simply a function depending upon the purpose, viz., whether it 
is being called from CLIPS, or simply some parameters are to be 
passed from CLIPS to the function. An external function can also 
be defined (Subsec. 3.3.1) so that it can be used in two ways, 
vis., as a C external function or as a FORTRAN external function. 
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In order to pass the parameters to an external function a 
C Ifttsr fa cg rgvitins is to be defined. The C-interface routine 
and usrfuncs both are put in the file runclips.C along with 
defining the function run_clips (vide Appendix A). 

3.4 Other Features 

CLIPS can be implemented on a number of machines and a number 
of special functions can be defined internally or externally or 
both. Again I/O routing system provided in CLIPS is quit flexible 
and allows a wide variety of interfaces to be developed and easily 
attached to CLIPS. Simple window system, batch system, and 
dribble system are some of the I/O routers that can be 
incorporated in CLIPS (Culbert, 1987) . 



CHAPTER IV 


AN ES FOR FREQUENCY ANALYSIS OF CONTINUOUS HYDROLOGIC DATA 

4 . 1 Frequency Analysis of Hydrologic Daia 

4-1.1 Introduction 

Hydrology is a natural science which deals with many complex 
processsi^ associated with the hydrologic cycle. Hydrologic data 
observed from historical natural hydrologic phenomena are the only 
sources of information upon which quantitative hydrologic 
investigations are generally based. The collection of hydrologic 
data have been continuously expanding and are in ever increasing 
amount. However, the majority of hydrologic processes in nature 
and their outcomes are affected by a number of causative 
factors. They may also be stochastic by nature and governed by 
laws of chance. The various hydrologic variables that describe 
these random processes are therefore also governed by laws of 
chance and are random by nature. 

The hydrologic analysis dealing with these random variables 
may be carried out by applying either a deterministic approach or 
a stochastic approach or a combination of these two. 

Deterministic approaches in hydrology apply principles and laws of 
physical sciences such as fluid mechanics, thermodynamics etc. 
However, it is not possible to include all or even a sufficient 
number of causative factors in deterministic cause and effect 
relationships in hydrology. Therefore, deterministic approaches 
in hydrology, e.g., dynamic flow equation, suffer from major 
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disadvantages of being highly subjective, losing some of the 
information and having no associated probability level (Yevjevich, 
1972; Kite, 1977). Moreover a function of a random variable is 
also a random variable and so to draw inferences from observed 
data statistical and probabilistic methods are to be applied. 

Probability theory is the mathematical discipline that deals 
with the measure of chance or likelihood of a random variable 
while statistics deals with computation of statistical measures 
asing data which are observations and/or measurements (Chow, 
1964). Hydrologic data can suitably be expressed in statistical 
terms and be treated with probability theory as they are highly 
erratic and commonly stochastic in nature. Statistical analysis 
.nterprets the hydrologic observations and attempts to extract 
laximum information from them. Then the hydrologic information 
an be presented in condensed form as graphs, tables of n-umbers and 
lathematical techniques can be applied for decision making in 
ater resources planning, conservation, development and control. 

An important problem in hydrology deals with interpreting a 
ast record of observations of hydrologic variables in terms of 
deir future probabilities of occurrence. These variables can be 
Loods, droughts, storages, rainfalls, water qualities, waves etc. 
elution to this problem may be sorted out in two ways either 
’aphically or statistically. The plotting of the observations of 
le variables/the number of occurrences with respect to ranges of 
;lues generally produces some kind of pattern. This pattern may 
used to extend the available data to estimate the design value, 
a large number of observed data are available covering a period 
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of record much larger ■than "the return period for design, the 
design value can be estimated directly from the sample. The 
graphical processes are simple, visually presentable and assume no 
prior distribution. However, generally it is not possible to 
obtain such a long length of data and furthermore the procedure is 
highly subjective, empirical, nonunique and therefore not 
compatible with other phases of engineering design. 

In statistical method, frequency analysis is used to fit a 
theoretical probability distribution function (pdf) to the sample 
data. Statistical parameters of the pdf are estimated and in turn 
they are used to estimate the design value. This method takes 
care of the stochastic behaviour of hydrologic processes upto some 
extent and is more compatible with various phases of engineering 
and design. 

4.1.2 Problems in frequency analysis of hydrologic data 

There is no general agreement among hydrologists on the 
specific choice of any particular theoretical distribution for 
frequency analysis of a given hydrologic variable at a given site. 
Frequency analysis, being a data dependent technique, is subjected?- 
to many constraints, limitations and assumptions also. Therefore 
the solution proposed often incorporates heuristics based on 
subjective judgment, experience, and common sense. The 
reliability of frequency analysis of hydrologic data depends upon 
many factors. Some of them are discussed as follows: 

a. Data Frequency analysis of hydrologic data is based on certain 
assumptions about the input data. In routine analysis the data 
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are assumed to be consistent, homogeneous and independent. 
However in actual practice these assumptions are seldom 
satisfied , 

In fact, the maximum floods are seldom, if ever, measured due 
to uncertainty in occurrence time and lack of proper gauging 
facilities at the proper time and place. As a result, they are 
estimated from the extrapolation of rating curves and other 
techniques. These estimates cause inconsistency in data with 

possible large errors. Due to urbanisation, industrilisation , 

A 

changes in instrumentation and observation, and construction of 
new hydraulic structures for flood protection or irrigation, the 
data become nonhomogeneous . Nonhomoginity may also result from 
the fact that the observations taken for a particular hydrologic 
variable may be due to more than one distinct cause and effect 
relationships. Again, for example if two flood peaks resulting 
from the same meteorological disturbances are recorded, the 
assumption of independence in the hydrologic data becomes 
questionable . 

Other errors resulting from data are due to random and 
systematic measurement errors and due to missing data. Missing 
data are often estimated by correlation and regression analysis of 
available data at other sites and/or other hydrologic variables. 
Proper knowledge of the transferability and similarity of the 
involved processes is required for such interpolations. 
Otherwise, the statistical analysis may be adversely affected, 
often severely. 


b- Variable Type Mostly hydrologic variables are continuous but 



the observations are, due to the least count of the instrument and 
sampling in time, discrete values. This causes loss of 
information. Some rare events occurring in the intervals between 
observations may be excluded from the analysis due to their non- 
availability , and this can seriously affect the reliability of the 
frequency analysis. 

c. Sample size In most cases of hydrologic variables, the 
underlying processes are of geologic age during most of which no 
records have been collected. Also it is not possible to obtain a 
sample of very large length due to economic and other constraints. 
Statistical parameters may be calculated from a sample of finite, 
too small length. These estimates are then utilised to fit a 
theoretical distribution to observed data. The associated 
uncertainties can be evaluated (Stendinges, 1980) using knowledge 
derived from subjective judgment and from past experience. The 
appropriate choice of sample sizes for different events are site 
and events specific, and are often based on common sense. 

d. Variations: Some hydrologic variables occur in clusters of 
very low or very high values. For example, a particular season 
may be having very small monthly precipitation while another 
season^ may have large values. In this case average monthly 
precipitation of each year may give a misleading result. These 
clusters are to be identified and then described statistically. 
Otherwise the probability of monthly values being in an interval 
around an average monthly value, may differ from the probability 
of those particular monthly values belonging to well defined 
clu 


jjters of very low or very high value (Yevjevich, 1972). 


e. Identification of outliers: An outlier in a set of data is 
defined as an observation or subset of observations which appears 
to be inconsistent with the remainder of that set of data. The 
inconsistency can be interpreted as the observation being either 
sigTiif icantly higher (outlier) or lower (inlier) at the high end 
compared to the values indicated by the rest of the data. If the 
observations is significantly higher or lower than expected in the 
low end, it will be termed inlier or outlier, respectively. 
Presence of these outliers or inliers seriously distort the 
estimation of distribution parameters. For example, the 
estimation of skew parameters in probability distribution such as 
log Pearson type 3 is greatly affected by the presence of low 
outliers. Since the appearance of outliers may be due to several 
reasons, e.g., application of inappropriate distribution function 
to historical data, nonstationarity of the time sequences of major 
floods (as, long term trends), abnormal diverse causes of 
rainfall, inaccuracy of measurements etc. , it is not very easy to 
correctly identify them. 

Attempts have been made with little success for detection of 
outlierg and inliers (Singh, 1987; WRC, 1981) but still it is a 
matter of subjective judgment of the analyst that whether he 
identifies some observations for special scrutiny in a 
conventional flood frequency analysis apparently having outliers. 

f . Specific problem in fitting a pdf: Apart from the factors! 

I 

mentioned above there are many more constraints. e.g., 
nonstationarity with respect to the process involved etc. , which! 
are very difficult to be taken care of by mathematical rules. Two 
basic questions need to be answered while fitting a particular 


distribution to a given set of hydrologic data. These questions 
may be stated as: 

1. Which of the many distributions available is the 'true’ 
distribution for the population ? and 

2. Which are the most appropriate methods of parameter estimation 
and goodness of fit test ? 

The validity of the first question is due to four factors: 

i. The data may follow a particular distribution 
naturally. 

ii. Sample data available are usually for relatively 
low return periods { i . e , , around the center of 
probability distribution) while the one required to 
be estimated is generally of larger return period 
(i.e. in the tail region of the distribution). 

iii. If the sample is derived from a population of mixed 
distributions one particular pdf will not fit to 
it. 

iv. Many distributions have similar shape in their centre 
but differ widely in the tails. Thus, it is possible 
to fit several distributions on the sample data and 
end up with several different estimates of a T year 
event . 

Second question arises due to fact that a number of 
statistical parameter estimation methods and goodness of fit tests 
are available, none of which has an absolute superiority or 
universal validity. For example, parametric and nonparametric 



goodness of fit tests are based on different assumptions about the 
statistics of population (Kite, 1977) and thus the reliability of 
these tests also depends upon the validity of these assumptions. 

These problems, if not dealt with proper attention and with 
an intelligent knowledge rich heuristic approach, will often lead 
to misleading results. Hence domain specific human expertise can 
be effectively utilised to address these problems. 

4.1.3 Expert knowledge related to frequency analysis of 
continuous hydrologic variables 

Frequency analysis of hydrologic data is a vast domain. 
Being in use .from years it has gathered some amount of expertise 
in terms of heuristics. This section deals with some of the 
expertise, available for frequency analysis of continuous 
hydrologic variables in general and flood frequency analysis in 
particular. Some basic ideas which leads a hydrologist to reach a 
final conclusion about selection of a particular type of 
distribution, can be listed as: 

a. Seasonality, stationarity and normality of data series: 
Seasonal variations must be considered in the selection of pdf. 
To take care of seasonality, different distributions which may be 
suitable for that season only should be fitted. Nonstationarity 
of data with respect to time and space can be detected by 
recognising the trends of data using time - series analysis and 
by observing changes in environmental conditions. 

b. Outliers: Presence of outliers should be tested right in the 
beginning of the analysis. If the frequency plots show the 



possibility of the presence of outliers, standard tests (Singh 
1987: Chow et al , 1988) should be adopted. Test for inliers and 
outliers for a normal distribution (Table 4.1) should be conducted 
and if they are present then data should be transformed to normal 
distribution perhaps using power transformation (Singh, 1987) and 
method of least squares is applied for parameters estimation. If 
the series is an exceedence series then data are transformed to 
negative exponential series but method of parameters estimation 
remains the same, i.e., method of least squares. 

c. Mixed distribution: There is always a chance of confusion 
_a.bO(xt combination of two distinct distributions i-_e- mixed 
distribution. A simple investigation about the causes of a 
particular hydrologic event and a frequency plot can provide a 
rational answer. If it is a mixed distribution the frequency 
analysis can follow the procedures discussed in Singh (1974) and 
Hawkins (1974). 

d. Concept of parsimony: Yevjevich (1972) as well as Haan (1977) 
recommended the selection of pdf having not more than two or three 
parameters. From mathematical point of view the more the number 
of parameters, the better it will fit a set of data. However, if 
number of parameters is more, reliability of estimated parameter 
becomes less. Thus an optimisation between flexibility of 
distribution and reliability of parameters should be achieved. 
Other things being equal, smaller number parameters are to be 
estimated leading to their parsimony. 

e. Suitability of distribution for different data Selection of 
pdf largely depends upon the type of hydrologic variable, vis., 
flood, drought, rainfall etc. For low flows or droughts, extreme 
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Test values of outlier and inlier departures. 


e 

SL 

Outllar 

lull*! 

Lov 1 
15-100 

Low 2 
10-100 

Low 3 
15-100 

Low 4 
30-100 

Low 5 
___ .40-100 

1 

<0.01 

>0.99 

Inlier 

Outlier 

<-0.689 

>1.029 

<-0.495 

>0.643 

<-0.412 

>0.498 

<-0.363 

>0.418 

<-0.327 

>0.368 

2 

<0.05 

>0.95 

Inlier 

Outlier 

<-0.532 

>0.681 

<-0.369 

>0.421 

<-0.303 

>0.337 

<-0.264 

>0.285 

<-0.237 

>0.253 

3 

<0.10 

>0.90 

Inlier 

Outlier 

<-0.441 

>0.503 

<-0.299 

>0.321 

<-0.243 

>0.254 

<-0.211 

>0.217 

<-0.188 

>0.193 

4 

<0.20 

>0.80 

Inlier 

Outlier 

<-0.318 

>0.297 

<-0.209 

>0.197 

<-0.167 

>0.159 

<-0.143 

>0.137 

<-0.127 

>0.123 

5 

<0.30 

>0.70 

Inlier 

Outlier 

<-0.221 

>0.161 

<-0.141 

>0.112 

<-0.110 

>0.092 

<-0.093 

>0.081 

<-0.082 

>0.073 

6 

<0.40 

>0.60 

Inlier 

Outlier 

<-0.132 

>0.052 

<-0.080 

>0.043 

<-0.060 

>0.037 

<-0.050 

>0.034 

<-0.043 

>0.032 




High 1 
15-100 

High 2 
20-100 

High 3 
15-100 

High 4 
30-100 

High 5 
40-100 

1 

<0.01 

>0.99 

Outlier 

Inlier 

<-1.054 

>0.679 

<-0.654 

>0.488 

<-0.511 

>0.407 

<-0.429 

>0.358 

<-0.377 

>0.323 

2 

<0.05 

>0.95 

Outlier 

Inlier 

<-0.683 

>0.529 

<-0.433 ' 
>0.369 

<-0.341 

>0.300 

<-0.290 

>0.263 

<-0.256 

>0.235 

3 

<0.10 

>0.90 

Outlier 

Inlier 

<-0.500 

>0.438 

<-0.322 

>0.299 

<-0.258 

>0.241 

<-0.221 

>0.209 

<-0.195 

>0.186 

4 

<0.20 

>0.80 

Outlier 

Inlier 

<-0.295 

>0.317 

<-0.197 

>0.209 

<-0.161 

>0.166 

<-0.139 

>0.143 

<-0.124 

>0.126 

5 

<0.30 

>0.70 

Outlier 

Inlier 

<-0.159 

>0.221 

<-0.112 

>0.140 

<-0.094 

>0.110 

<-0.082 

>0,093 

<-0.074 

>0.082 

6 

<0.40 

>0.60 

Outlier 

Inlier 

<-0.051 

>0.132 

<-0.043 

>0.079 

<-0.039 

>0.060 

<-0.035 

>0.050 

<-0.032 

<0.043 


Notes: 15-100, and 40-100 denote the range of sample size n in 

years; * denotes window; SL significance level 



value type 3 or Pearson type 3 distributions (Kite, 1977) will be 
suitable. For floods lognormal, Pearson type 3 and extreme value 
type 1 and extreme value type 3 distributions are suitable. For 
rainfall, extreme value type 1 distribution is preferable (Chow, 
1964). Similarly for exceedence series, negative exp'onential 
distribution is well suited. 

f. Parameter ranges for different distributions: Statistical 

parameters like descriptors of central tendency, dispersion and 
peakedness are most commonly used for preliminary selection of 
pdfs. 

Hydrologic variables are generally asymmetrically 

distributed. Therefore, selection of the proper set of 
statistical parameters are important for identifying the "true" 
pdf. Generally arithmetic mean is believed to be the best 
central value parameter. The coefficient of skewness often acts 
as regional isation parameter and becomes very important when mean 
and standard deviation have small variation. The coefficient of 
peakedness or kurtosis is also useful for determining the best fit 
pdf. Often a combination of skew and kurtosis determines the pdf 
type. A guide to the selection of pdf in terms of kurtosis and 
skewness of the sample data is given in Fig 4.1 (Raudkivi, 1979). 
Some criteria for the initial selection of pdf based on estimated 
sample statistics are presented in Table No. 4.2. 

The following notations are used: 

^ - population mean 
er = population standard deviation 
Cg = coefficient of skewness for population 
CV = = coefficient of variation 

Cjf = coefficient of kurtosis 



1 

Un iform 
distribu- *2 
tion 

Normal 


2 


3 


Cs 

Regions in plane for various d; 

after E.S. Pearson. From Hahn and Shap; 
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Table 4.2 


Theortical and Heuristic Parameters Ranges 
for Different Frequencies Distributions. 


case Theortically 

nos . 

Heuristical ly 

1 - - - 

Suggestion 

1- C = 0.0 

-0.5 < C < 0.5 

I 

s I 

s 


CV < 0.4 

same 


C, > 1.0 
k 

same 


and 



i- if < 2.0 

if C, < 2.25 

k 

Uniform distribution 

ii . if C, =3.0 
k 

if (2.0 < C. < 4.0 

V, ^ 

Normal distribution 

iii. if > 3.0 

if 4.0 < C|^ < 10.0 

t distribution 

2« Cg < 0.0 

Cg ^-0.5 

Pearson type III or 

Lognormal type III 

can be fitted. Prior- 
ity may be given to 

Pearson type III on 

the basis of parsi- 
mony of parameters. < 

3’ C > 0.0 

C > 0.5 


s 

s 


i. C, <(1.25C +1.0) 

k s 

same 

No distribution, 

(impossible region) 

or C, < 1.0 
k 



ii . C, =20+3.0 

k s 

1.25C +3<C, <2.250 +3 
s k s 

Lognormal II 

distribution 


contd 
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V 

vi 

Note 

1 


ii. C = 1.14 
s 

and C, = 5.4 
k 

iv. C, = 3.0+6CV 
k 

Where, CV = 1/y~N 


V. C = 4.0 
s 

C, = 9.0 
k 

Vi. (1.06250^+1. 75 )< 

C, <(1.81250 +1.75) 
k s 

ii. (0 +1.0) < 
s 

0, <(1.06250 +1.75) 
k s 

0 

ii. (1.250 +3.0) < 

5 

0, <(1.81250 +1.75) 
k s 

s Alternatively 
. Input data can be 
transformed data 
or Ext^rr^kl type 


0.64 <0 <1.64 

s 

Extreme value type 

4.5 < < 6.5 

I distribution 

2+60 V < < 4+60 V 

Gamma distribution 

Where, 


^ -0.2<0V< ^ +0.2 

iTir -/~w 

1 


3.5 < Cg < 4.5 

Exponential 

8 < 0, <10 
k 

distribution 

same 

Beta (j) 


distribution 

same 

Beta (u) 


distribution 

same 

Beta or Gamma 


distribution 


log transformed to a Y series and then 
can be tested for Normal , Pearson type 111 
1 disributions. 



g. Method of parameter estimation: Theoretically, method of 
maximum likelihood estimate is the best method (Chow et al, 1988) 
for parameters estimation if the sample size is more than 25. 
However, if outliers are present ^ method of least squares or 
special WRC procedures (Chow et al, 1988) must be used for the 
parameter estimation. Stedinger (1980) has discussed about 
specific methods of parameter estimation for lo.gnormal 
distribution based on length of data and coefficient of Skewness. 
Similarly specific methods for parameter estimation for Pearson 
type 3 (hall, 1987), and Log Pearson type 3 (Rao, 1987) have been 
formulated. 


h. Transformations: Haan (1977) suggests that transformations 
are useful in the case of bounded distributions. Transformations 
are applied to normalise the data. Depending upon the coefficient 
of skewness a particular type of transformation is chosen. For a 
positively skewed data log transformation or n^ root 

transformations is needed. For negatively skewed data inverse 
Pearson or n root transformation is needed. Two step power 
transformation (Gupta, 1989) is the most powerful transformation. 


Though this discussion does not provide an answer to all 
questions and uncertainties raised earlier, it clearly suggests a 
set of rule based procedures that can be utilised in the frequency 
analysis of continuous hydrologic variables. It clearly states 


some of the associated problems and suggests some solution 

central library 

* *■ T , KAN‘Pua 


procedures . 


Acc. No A. . U 

An extension of these rules, based on 'Ih&re-— vigorous 


evaluation and specific expertise of knowledgeable experts in this 



field can be introduced as knowledge based for solving these 
problems. This can actually lead to a comprehensive expert system 
for frequency analysis of hydrologic data. 

4 . 2 Structure of FACHVKS Program 

4.2.1 Introduction 

Frequency Analysis of Continuous Hydrologic Variables 
Embedding Expert System (FACHVES) program is an interactive menu 
based FORTRAN program with an embedded expert system to provide 
decision making support for frequency analysis of continuous 
hydrologic variables. The program consists essentially of four 
parts, a main program, subroutines, ES interface and a rule based 
knowledge base. 

4.2.2 Environment 

FACHVES programs have been developed, implemented aind tested 
in VAX - VMS environment. However it can be implemented on HP 
9000-800 UNIX environment or on IBM PC with slight modifications. 

4.2.3 Main program 

The main program of FACHVES is a menu based program 
(Fig. 4.2) consisting of one master menu and eight submenus. 

The master menu can call any of the eight submenus depending 
upon the option of the user. The submenus are basically for 
preliminary analysis, method of analysis, choice of distribution, 
transformations, goodness of fit test, plotting of results and 
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Fig. 4.2 Schefcatic Flow Chart of Wain Program 



















ES interface. Each submenu consists of several options and 
depending upon user’s choice, the desired subroutine can be 
called. However, the user is supposed to create an input file or 
give input data interactively before calling the master menu for 
the first time. Options for seasonal data analysis is displayed 
in the master menu only when the data is seasonal. A listing of 
the main program is given in Appendix D. 

4.2.4 Subroutines 

In FACHVES several subroutines already available in FORTRAN 
for statistical analysis and some subroutines, e.g., for two step 
power transformation have been incorporated. They can be called 
either from main program or from other subroutines, as per 
requirement. The subroutines are basically for 

a. fitting different distributions; 

b. applying different goodness of fit tests; 

c. transforming the data using different transformations; 

d. using different methods of analysis; 

e. linking FORTRAN program to CLIPS, and 

f. converting strings from FORTRAN to C and vice versa. 

The subroutines are programmed in such a way that they 
perform one or a combination of the above functions. The 
limitations in using a subroutine, if any, are conveyed to the 
user interactively during execution. Table 4.3 lists the major 
subrouting used in frequency analysis. 


One of the subroutines PTHELP acts as ES interface and it has 



Tabl« 4 .. 3 




List of Main Subroutines 


Subroutine 
PAR AM 
ND 
LN2 

LN3 

PT3 

LP3 

TIE 

T3 

FAP 


LOGTRAN 

IPT 

SQTRAN 

TSPT 

MLS 


Function 

Statistical parameter calculation 
Fits a normal distribution 

Fits a lognormal £ distribution and estimates T year 
events 

Fits a lognormal 3 distribution and estimates T year 
events 

Fits a Pearson type 3 distribution and estimates T year 


events 

Fits a lognormal distribution and estimates T year 
events 

Fits a Type 1 Extremal distribution and estimates 
T year events. 

Fits a Type 3 Extremal distribution and estimates 
T Year events. 

Fits different distributions for seasonal data, 
calculates seasonal variations and tests the 
goodness of fit (Chi-square) 

Trnsforms the data logarithmically 

Transforms the data by Inverse Pearson transformation 
Transforms the data by square root transformation 
Transforms the data by two step power transformation 
Fits a normal distribution using method of least 


square 


contd 
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SER 


CHISQRT 

KOLMG 

PTHELP 

STOREC 

LDADC 

ADVICE 


Computation of the standard errors of events computed 

from various probability distributios compared to the 

observed event magnitude 

Chi-square test 

Kolmogrov— smi rnov test 

ES interface ; calls ES program 

Converts a FORTRAN string in to C string 

Converts a C string in to FORTRAN string 

Passes ES advice to main program 



been incorporated as an optional calling subroutine from FORTRAN 
main program. The subroutine PTHELP links the main program with 
the CLIPS function calls and string conversion function STOREC as 
specified in Subsec. 3.3.2 (b). The suggestions given by ES is 
converted to FORTRAN strings, by LOADC function and stored in a 
subroutine ADVICE which subsequently passes them to the main 
program . 

4 . 2.5 Knowledge base: The main function of the ES is to 
supplement the computational procedures specified in the main 
program with a number of advices for choosing the right 
alternatives for analysing the data. Without the ES knowledge 
base (KB) similar task could have been accomplished, however, only 
by going through an exhaustive process of all the available 
alternatives. The ES knowledge base helps to eliminate some of 
the non-feasible alternatives of the pdf fitting exercise. It 
also incorporates subjective judgments inbuilt in the KB, or 
provided by the user, to choose the right path of evaluation. 

The ES is embedded in the main program and accessed by it 
during execution to accomplish the following tasks: 

i . provide an interface for user supplied information in the 
decision making process. 

ii. transfer appropriate parameters to the FORTRAN main 
program, based on the advice of the ES, 

iii. advice on the adequacy of the data for statistical 
analysis 

iv. advice on the initial choice of distribution functions 
based on estimated values of statistical parameters . 

V. advice on the suitable choice of transformations based on 
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thte estimated values of statistical parameters, and 
vi.if the user feels he has outliers, then advice on the 
method to deal with outlier in the data. 

The rule based ES’s knowledge base is composed of a set of 
rules that reflects expert knowledge in this field. It also 
provides a vehicle for incorporating subjective judgment in the 
selection of procedures for statistical analysis, while guiding 
the user through various stages of decision making. As this study 
is of a preliminary nature with the goal of developing a framework 
for an ES for frequency analysis of continuous hydrologic 
variables, the rules developed are only exploratory in nature and 
are certainly not complete. The flow chart showing the various 
components and functions of the ES knowledge base is shown in Fig. 
4 . 3 . 


Knowledge base is accessed during the execution of the main 
program. The type of inputs provided to the knowledge base 
directly from the main program includes the estimated values of 
the statistical parameters of the raw hydrologic data, and the 
length of the record. Based on these inputs, the ES guides the 
user through a series of steps, where existing patterns (rules) in 
the KB, are matched with the patterns constructed through user 
input and given input information. Matching or nonmatching of 
these patterns leads to either a set of parameters or control 
variables being passed on to the main program, or the testing of a 
subsequent rule. 

Once all the existing rules compatible with the facts, 
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Flf. 4.3 cJurl tf Knovltds* hit if ES. 
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asserted by the user or generated by the actions of rules have 
been tested, all the parameters and control variables as specified 
by the ES are transferred to the main program. Also a list of 
suggestions on the subsequent procedures of statistical analysis 
are displayed by the ES, to help the user. The ES knowledge base 
can be accessed by the user from the main program by using either 
the master menu or the submenu preliminary analysis, whenever 
user needs some advice. 

4 . 3 Data Used 

The data used to test the developed program are of two 
types; seasonal (monthly data along with monsoon, nonmonsoon and 
annual) and nonseasonal data. Two sets of seasonal data have 
been tested to verify the feasibility of this approach. These 
are: 

A. Non seasonal data 

a. Annual peak discharge of river Narmada (1951-1982) 

b. Annual peak discharge of river Mahanadi at Hirakud(1946- 

1982) 

B. Seasonal data 

In seasonal data, monthly data along with monsoon, 
nonmonsoon and annual data have been considered as seasons for 
convenience of analysis. Therefore, 12 months and 3 seasonal sums 
in a year have been considered. The Hirakud flows for 1946 to 
1982 have been used in the analysis. 


The data used in the analysis are presented in Table 4.4. 
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Input Data 


THE ANNUAL PEAK FLOOD DISCHARGE AT KOETAKKA FOR NARMADA (1951~1S82) 


28.1.7.0 

11127. 0. 13631.0.19521. 0.33915. 0.20746. 0.1 1982. 0.25023. 0.31804. 0.16135.0, 

23438.0. 18591.0.11338.0.19690.0.31604.0.27935.0.41891.0.18101.0.47651.0, 

54063.0. 36562.0.33278.0.17713.0.24354.0.29564.0.26232.0.22751.0.25662.0, 

16602.0 


HIRAKUD 

FLOUS(ANNUAL) IN THM 

(1947-82) 



37,1,7,0 

60B9.BO 

4771.50 

4773.40 

4271.20 



4344.30 

5AA2.10 

4667.90 

3419.30 

2037.10 

3837.20 

4271.40 

2413.20 

4425.30 

4645.00 

4253.20 

9377.60 

2058.90 

3890.20 

5500.50 

1617.90 

2275.90 

4121.60 

2662.80 

2638.10 

4632.90 

4913.10 

2501,80 

5449.00 

1821.20 

4611.50 

4283.30 

2708.20 

3226.40 

2482.00 

3927.20 

3689.40 

1306.90 


HIRAKUD FLOWS IN THM (1946-82) 
555,15,11,0 


289.5 

1304.6 

3054.5 

880.7 

327.7 

100.4 

48.0 

25.9 

25.9 

19.0 

9.9 

3.7 

5857.0 

232.8 

8.5 

1030.0 

1745.7 

1499.7 

338.7 

. 54.6 

31.0 

31.1 

15.5 

8.5 

5.0 

3.2 

4622,6 

148.9 

60.6 

769.4 

2005.4 

773.8 

943.7 

104.8 

55.4 

24.1 

17.5 

9.3 

4.9 

4.5 

4552.9 

220.5 

62.5 

652.6 

1864.2 

927.8 

495,1 

150.7 

45.5 

21.2 

16.6 

22.6 

9.4 

3.0 

4002,2 

269.0 

83.0 

1172.4 

2049.1 

719.6 

107.3 

43.0 

25.0 

17.7 

11.0 

14.6 

99.5 

3.1 

4131.4 

212.9 

36.9 

226.8 

1413.8 

405.8 

243.0 

62.8 

22.3 

10.3 

7.5 

5.7 

2.1 

5.1 

2326.3 

115.8 

70.2 

805.8 

1483.7 

1874.0 

257.6 

79.7 

34.1 

36.3 

18.5 

4.0 

2.3 

1.7 

4491.3 

176.6 

22.2 

991.0 

1555.8 

580.7 

169,0 

68.0 

- 19.2 

9.2 

1.7 

0.8 

0.7 

1.0 

3318.7 

100.6 

35.0 

222.9 

527.6 

1030.1 

170.7 

40.0 

3.2 

3.6 

1.6 

0.8 

0.8 

0.8 

1986.3 

50.8 

99.9 

755.9 

911.0 

1287.0 

617.1 

102.3 

21.1 

17.1 

14.0 

5.9 

3.2 

2.7 

3670.9 

166.3 

303.0 

1414.7 

1618.5 

534.7 

245.3 

74.1 

10.8 

11.8 

10.7 

25.1 

13.8 

8.9 

4116.2 

155.2 

12.0 

593.8 

1236.9 

387.6 

41.3 

28.7 

21.3 

16.4 

27.1 

44.9 

1.6 

1.6 

2271.6 

141.6 

44.4 

1433,2 

886.2 

1240.9 

599.0 

111.4 

43.1 

28.1 

17.2 

6.3 

10.3 

5.2 

4203.7 

221.6 

30.7 

625.5 

1816.7 

1708.9 

273.0 

72.3 

40.3 

28.8 

10.0 

19.6 

9.3 

9.9 

4454.8 

190,2 

62.8 

797.9 

2219.7 

495.0 

453.9 

76.8 

40.9 

32.0 

40 . 1 . 

13.4 

12.3 

8.4 

4029.3 

223.9 

675.7 

2762.1 

1896.3 

2990.9 

719.4 

150.6 

86.6 

41.9 

19.9 

16.7 

8.9 

8:6 

9044.4 

333.2 

59.7 

406.1 

753.5 

555.0 

131.2 

* 61.9 

34.3 

16.1 

15.2 

19.9 

2.5 

3.5 

1905.5 

153.4 

96.3 

611.9 

1352.3 

1370.2 

265.6 

93.7 

40-9 ‘ 

21.7 

12.1 

17.2 

3.4 

4.9 

3696.3 

193.9 

165.9 

1566.6 

2168.0 

863.0 510.1 

113.7 

19.9 

29.2 

11.8 

26.0 

18.8 

7.5 

5273.6 

226.9 

45.2 

389.2 

385.6 

558.8 

113.3 

43.1 

23.9 

21.2 

16.6 

6.8 


11.2 

1492.6 

125.3 


6089.8 
4771.5 

4773.4 

4271.2 

4344.3 

2442.1 

4667.9 

3419.3 

2037.1 

3837.2 

4271.4 
2413..2 

4425.3 
4645.0 

4253.2 
9377.6 

2058.9 

3890.2 

5500.5 

1617.9 


31.2 
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4 . 4 Results and Discussions 


The developed program can be used in two ways : 

a. Analysis for annual data (if data are nonseasonal ) 

b. Season by season analysis (if data are seasonal) 

A. Analysis for annual data 

Analysis of two sets of data have been done and the 
interactive session with the program for one of them has been 
reported completely in Table 4.5. 

a. Analysis for annual peak discharge 

The statistical analysis for Narmada river shows that data 

are positively skewed (0.959) and coefficient of kurtosis is 

4.0343. The KBES suggests a distribution which can be a Pearson 

type 3 or log Pearson type 3 or log normal type 3. The suggestion 

is also confirmed by flow chart of KBES (Fig 4.3) and Table 4.2. 

However, a Pearson type 3 distribution has been selected. Chi- 

2 

square statistics of 2,998 for 3 degree of freedom ( )^ 95^, 3 

DOF=7.81) indicates that a distibution with parameters shown in 
Table 4.5 is a good fit at 95% confidence level. Table 4.6 shows 
the whole session and the fitted distribution. 

Similar analysis for Mahanadi river at Hirakud indicates a 
positively skewed data (1.15) having coefficient of kurtosis 
6.612 and the KBES gives similar suggestion. However a Pearson 
type 3 distribution has been selected. Chi-square statistic of 
5.165 for 3 degree of freedom indicates that the distribution 
with parameters shown in Table 4.6 is a good fit at 95% 
confidence level. The fitted distribution is shown in Table 4.6. 

b. Season by season analysis 

Season by season analysis has been presented for three 
seasons and the interactive session with the program for one of 



64 


Table jLi! 

Interactive Session I 

(For annual data of peali discharffes of river Narmada) 


A MASTERS THESIS OF O.P.HISHRA AA 
A UNDER THE GUIDANCE OF AA 
A Dr .S.Ramaseshan and Dr.B.Dutta AA 
A at I. I. T. Kanpur AA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A WISH YOU HAPFY COMPUTING AA 
A WITH AA 
A FACHVES AA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A A— A FREQUENCY-ANALYSIS WITH EXPERT ADVICE A— A 
^ ^ 


AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A INPUT MODULE A 

^ * 


Do you want to sive input on the screen? 

If yes, Type 1 ,If no, Type 2 
If you want to quit Type 3 

2 

--The input file should be nawed as lEST.INP 
It must have followins : 

A a TITLE with forttat 80A1 
AND, unf or Bated 
A NX — Number of data 

A NS — Number of seasons 

(for annual series NS =1) 

A NPT — Number of points 

NPTsNCLASS+1 where NCLASS is number of 
classes in which data can be divided 
for chisquare test- The class frequency 
should be equal to 5 
(It can be 7/11/13) 

A Observational hydrolosic data RX(I) 

(Not more than 600; 

seasonsdn row)/years(in column) 

Have you sot such an input file ? If yes, type 1 
If you want to quit ,type 2 
If you want to go to previous menu type 3 


1 

* ^ 

A MASTER-MENU A 

* : ^ 

A 1. PRELIMINARY ANALYSIS A 

A 2. CHOICE OF DISTRIBUTIONS A 

A 3. TESTS FOR GOODNESS OF FIT A 

A 4. PLOTTING OF RESULTS A 

A 5. QUIT A 

A 6. HELP FROM EXPERT SYSTEM INTERFACE A 

^ 


AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA"^ • 
Type the serial number of desired option 
1 

AAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A $n PRELIMINARY ANALYSIS $$$ A 

^ ^ 

A 1. STATISTICAL PARAMETERS A 

A 2. FREQUENCY PLOT A 

A 3.C.D.F.PL0T A 

A 4. TRANSFORMATIONS A 

A 5. HELP FROM EXPERT SYSTEM INTERFACE A 

A 6. TO MASTER MENU A 

^ ^ 


Type the serial number of desired option 



1 

«EAN= 25382.2852 
VAR= 0.114G1E+0S 
SKEW= 0.95990E+00 
£TDEV= 0.1070GE+05 

KURTOSIS= 0.40343E+01 AUG,DEVIATION= 0.82410E+04 
THIS ANALYSIS IS FOR LOGTRAHSFORHED DATA 
MEAN= 10.0593 
OAR= 0.17221E+00 
SKEW= 0.96092E-02 
STt!EV= 0.41498E+00 
KURTOSIS= 0.28240E401 
XKIN= 0.93171E+01 X«AX= 0.10898E+02 
MEAN, AND, STDEO FROM SCREEN 

10.0593 

0.41498 

FOR ORIGINAL DATAAj^AA 
CHECK FOR OUTLIERS BY WRC METHOD 
XMIN= 0.11127E405 XMAX= 0.540G3E+05 
No. of hish outliers = 0. 

No. of law outliers= 0. 
AA*AAA**AAA*AA***A*AA****A*A*AjlcaA*AA***** 

A ii$ PRELIMINARY ANALYSIS i%% A 

^ — ^ 

A 1. STATISTICAL PARAMETERS A 

A 2. FREQUENCY PLOT A 

A 3.C.D.F.PL0T A 

A 4. TRANSFORMATIONS A 

A 5. HELP FROM EXPERT SYSTEM INTERFACE A 

A 6. TO MASTER MENU A 

^ ^ 

Type the serial nuBiber of desired option 
5 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A $$ EXPERT SYSTEM INTERFACE $$ AA 

* ^ 

A l.FOR PRELIMINARY ADVICE A 

A 2. FOR SECONDARY ADVICE A 

A 3. FOR EXPERT ANALYSIS OF RESULTS A 

A 7. TO MASTER MENU A 

— ^ 

Which help do you want? 

Type the serial nunber of desired option 
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1 

f-0 (initial-fact) 

f-1 (start) 

f-0 (initial-fact) 

f-1 (start) 

f-2 (Bean 25382) 

f-3 (skew 0.95990002) 

f-4 (kurt 4.03429985) 

f-5 (stdev 1070S) 

f-6 (length of data 28) 

WELCOME 

TO 

F A C H V E S 

This expert system is designed for single 

peaked continuous hydrologic variables 

IF YOU DO NOT UNDERSTANT THE QUESTION ; Type nk 

Type of data type the first letter (e/a/s) 

Whether it is exceedence OR annual ?3 
Do you think that outliers are present? no 
THIS ANALYSIS IS FOR LOGTRANSFORMED DATA 
HEAN= 10.0593 . 

VAR= 0.17221E+00 
SKEW= 0.96092E-02 
STDE0= 0.41498E+00 
KURTOSIS^ 0.28240E+01 
IMSN=tDe98iriEi»iOiypMMAXan0aifl886fif«£iori of 
logtransforaed data 70.41498 

You can fit 

1. a pearson type 3 distribution 

2. a logpearson type 3 distribution 

3. a lognomal 3para«eter distribution 
Gamnia distribution is well suited 

l.You can fit either one OR all available distributions 
PROCESS IS BEING STOPPED; 

RESET THE PROCESS IF YOU WANT TO CONTINUE 
WITH OTHER SET OF DATA 
Do you want any help ? no 

Suggestions passed should be taken in sequencial order 

f-0 (initial-fact) 

f-1 (start) 

f-2 (Bean 25382) 

f-3 (skew 0.95990002) 

f-4 (kurt 4.03429985) 

f-5 (stdev 10706) 

f-6 (length of data 28) 

f-7 (type of data a) 

f-8 (process continue yes) 

f-9 (data is large enough) 

f-10 (outliers are no) 

f-11 (positive skew) 

f-12 (stdev of logtran data 0.41497999) 

f-13 (logtran val is 2.49346423) 

f-14 (suggestion 15) 

f-15 (esad PT3,LP3,LN3) 

f-16 (suggestion 10) 

f-17 (esad 6D) 

f-18 (stop) 

f-19 (help no) 

Now you are out of clips 
Value passed * 15.00000 

Name passed is = 

PT3,LP3yLN3 


only 



* * 

A HASTER-MENU A 

- ^ 

* 1. PRELIMINARY ANALYSIS A 

A 2. CHOICE OF DISTRIBUTIONS A 

A 3. TESTS rOR GOODNESS OF FIT A 

A 4. PLOTTING OF RESULTS A 

A 5. QUIT A 

A 6. HELP FROM EXPERT SYSTEM INTERFACE A 

^ ^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
VALUE PASSED IS 15.00000 
NAME PASSED IS 
PT3,LP3,LN3 


Type the serial nurober of desired option 
2 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A *$ METHOD OF ANALYSIS %% A 

^ ^ 

A 1. METHOD OF MOMENTS A 

A 2. METHOD OF MAXIMUM LIKEIYhOCI A 

A 3. METHOD OF LEAST-SQUARES A 

A 4. TO MASTER MENU A 

* ^ 

Uhich method do you like? 


Type the serial number of desired option 
1 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AA %% CHOICE OF DISTRIBUTION $$ AA 

* ^ 

A 1. NORMAL A 

A 2. LOG-NORMAL (2-PARAMETER) A 

A 3. LOG-NORMAL O-PARAMETER) A 

A 4. PEARSON TYPE 3 A 

A 5.L06-PEARS0N TYPE-3 A 

A 6. EXTREME-VALUE TYFE-1 A 

A 7. EXTREME-VALUE TYPE-3 A 

A 8. FAP— PROGRAMS A 

A 9. TO MASTER MENU A 

A 10. TRANSFORMATIONS A 

A 11. METHOD OF ANALYSIS A 

A- A 

If you want to transform the dataAATYPE-lO 

If not, then which distribution? type the serial nunber 

4 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A S$ TEST OF GOODNESS OF FIT $$ • AA 

A A 

A 1. CHI-SQUARE TEST A 

A 2.K0LM0G0R0V-SMIFN0V TEST A 

A 3. TO PLOTTING RESULTS A 

A 4.C0MPARISI0N OF STANDARD ERROR A 

A OF ESTIMATE FOR DIFFERENT A 

A DISTRIEUTIONS A 

A 5. TO MASTER MENU A 

A 6. HELP FROM EXPERT-SYSTEM INTERFACE A 

A A 

Type the serial number of desired option 
1 



CHI'SQUAFE STATISTIC 
AAAAkAAAAkAiMhkAHAAA 


2.998593 

NO. OF DEGREES OF FREEDOM 
AAAAAAAAAAAAAAAAAAAAAAAAAA 

3 

kAAAAAAAAAAAAAA.AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


AA II PLOTTING RESULTS A 

A A 

A l.FREOUENCY PLOT A 

A 2.C.D.F-PL0T A 

A 3. TO MASTER MENU A 

A— A 


Type the serial number of desired option 


I * 

A MASTER-MENU A 

^ ^ 

A 1. PRELIMINARY ANALYSIS A 

A 2. CHOICE OF DISTRIBUTIONS A 

A 3. TESTS FOR GOODNESS OF FIT A 

A 4. PLOTTING OF RESULTS A 

A 5. QUIT A 

A 6. HELP FROM EXPERT SYSTEM INTERFACE A 

* ^ 


AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
VALUE PASSED IS 15.00000 
NAME PASSED IS 
PT3,LP3,LN3 

Type the serial number of desired option 
5 

FORTRAN STOP 

I 


PEARSON TYPE 3 DISTRIBUTION 
METHOD OF MOMENTS 


ALPHA 

BETA 

GAMMA 


0.66981E+04 

0.25547E+01 

0.82707E+04 


Ml 

H2 

SKEW 


0.25382E+05 

0.11461E+09 

0.12513E+01 


T, YEARS 2 5 10 20 50 100 

X 0.23245E+05 0.33094E+05 0.39632E+05 0.45825E+05 0.53713E+05 0.59543E+05 


T 

S 0.13833E+04 0.2415GE+04 0.37362E+04 O.57524E+04 0.90896E+04 0.11960E+'05 
T 
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Table 

Fitted Distributions to Different Sets of Data 


. HIEAKUD FLOWS (ANNUAL) 


PEARSON TYPE 3 DISTRIBUTION 
HITHOD OF ?1CKENTS 


ALPric 

0.1083EE+04 


Ml 

0.37935E+04 


BETA 

0.19977E+01 


M2 

0.23466E+07 


GAMMA 

0.16283E+04 


SKEW 

0.14150E+01 


T, YEARS 2 

5 

10 

20 

50 

100 

X 0.34519E+04 

0.48603E+04 

0.5e229E+04 

0.6748SE+04 

0.79428E+04 

0.BB342E+04 

i 

S C.18859E+03 

0.32250E+03 

0.48730E+03 

0.75913E+03 

0.12242E+04 

0.16304E+04 


T 


.HIRAKUD FLOWS (MONSOON) 


THREE PARAMETER LOGNORMAL DISTRIBUTION 
METHOD OF MOMENTS 




MEAN OF X 
VARIANCE OF X 
SKEW OF X 

A 


0.36383E+04 

0.21487E+07 

0.11436E+01 

-0.37778E+03 



T, YEARS 

2 

5 

10 

20 

50 

100 


X 0.33948E+04 0.47026E+04 0.55580E+04 0.&37ieE+04 0.74219E+04 0-82113E+a4 
T 

S 0.27421E+03 0.35519E+03 0.47614E+03 0.70147E+03 0.n334E+04 0.15421E+04 
T 


CoYiteLt 



HIRAKUD FLOWS (NONMONSOON) 


THREE PARAMETER LOGNORMAL DISTRIBUTION 
MAXIMUM LIKELIHOOD PROCEDURE 


TRIAL A F(A) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 


0.34464E+02 

0-25506E+02 

0.13168E+02 

~0.29256E+01 

-‘0.22773E+02 

-0,45801E+02 

-0.70666E+02 

-0-94548E+02 

'0.11371E+03 

-0-12250E+03 

“0.12561E+03 

-0,12628E+03 

-0-124(^2E+03 

~0.12584E+03 

-0.12560E+03 

-0.12445E+03 

-0.12574E+03 

~0.12495E+03 

-0-12413E+03 

-0.12586E+03 

-0-12538E+03 

-0.12600E+03 

-•0.12644E+03 

-0.12407E+03 

~0.12553E+03 

-0.12553E+03 


-0.14010E+00 

-0.61615E“01 

“0.26492E-01 

-0.11117E-01 

-0.45460E-02 

-0.18072E-02 

“0.69714E-03 

-0.25356E~03 

-0.B6248E”04 

-0.20146E-04 

-0.52452E-05 

-o.ioisaE-os 

0.24438E-05 

-0.19073E-05 

0.35763E-06 

0.17285E-05 

-0.20266E'05 

0.11921E-05 

0.12517E~05 

-0.27418E-05 

0.71526E-06 

-0.95367E-0S 

“0.65565E-06 

0.34571E-05 

-0.23246E-05 

O.OOOOOE+00 


A 

MEAN OF LN(X-A) 
VARIANCE OF LN(X-A) 
SKEW OF LN(X«A) 


-0.12553E+03 

0.56953E+01 

0.43202E-01 

0.30518E-03 


FOR GOOD USE OF THIS DISTRIBUTIONSKEW OF LN(X-A) SHOULD BE CLOSE TO ZERO 
T>YEARS 2 5 10 20 50 100 


X 0.17195E+03 0.22881E+03 0.26274E+03 0.29320E+03 0.33034E+03 0.35692E+03 
T 

S 0.10948E+02 0,14112E+02 0.18349E+02 0.24095E+02 0.33449E+02 0.41559E+02 
T 


them has been shown in Table 4.7. 


The statistical analysis for the data for monsoon season at 
Hirakud indicates positively skewed data with coefficient of 
kurtosis 6.891. The KBES suggests Pearson type 3, log Pearson 
type 3 and lognormal type 3. However Chi-square statistic for 7 
degrees of freedom comes 37.019 for Pearson type 3 distribution 
and 38.07 for log Pearson type 3. For lognormal type 3 this value 
is 13.387. Therefore the distribution selected is lognormal type 
3 which indicates a good fit at 95% confidence level (){-_, >7D0'F= 
14.1) . 

Similar analysis for nonmonsoon season indicates lognormal 
type 3 distribution to be a good fit with Chi-square statistic 
4.508 at 95% confidence level with 7 degrees of freedom. 

However for the month of the January the data is found to 

o 

have an approximate 2ero/(0.0529) , coefficient of kurtosis as 
2.68 and coefficient of variation 0.401. The KBES suggests a 
normal distribution for the season. The session output i.o shown 
in Table 4.7. Chi-square statistic of 1.73 for 7 degrees of 
freedom indicates that the fitted distribution is a good fit at 
95% confidence level. 

4 . 5 Conclusions 

The test runs of the program FACHVES with the above sets of 
data shows that the KBES is very helpful arriving at correct 
decisions based on informations available either from FORTRAN 
program or user for fitting a probability distribution to a given 
set of data. It also establishes the feasibility of the approach 
to combine a statistical computational process with the heuristic 
capabilities of an expert system. However the program is to be 
improved signif icantly if it were to be powerful expert system. 



Table 4.7 

Interactive Session II 

(For seasonal data for the flows in January in river Mahanadi) 


A HASTENS THESIS OF O.P.HISHRA AA 
A UNHEP THE GUIUANCE OF AA 
A Dr .S. Raipaseshan and Dr.B.Dutta AA 
A at I. I. T. Kanpur AA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A WISH YOU HAPPY COMPUTING AA 
A WITH AA 
A FACHVES AA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A A— A FREQUENCY-ANALYSIS WITH EXPERT ADVICE A— A 


AAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAA AAAAAAA 
A INPUT MODULE A 
A A 


Do you want to 3 ive input on the screen? 
If yes, Type 1 ,If no, Type 2 
If you want to quit Type 3 

2 


— The input file should be naaed as TEST.INP 
It iust have following : 

A a TITLE with format 80A1 
AND, unforaated 
A NX — Nuaber of data 

A NS — Nuaber of seasons 

(for annual series NS =1) 

A NPT — Nuaber of points 

NPT=NCLASS+1 where NCLASS is nuaber of 
classes in which data can be divided 
for chisquare test. The class frequency 
should be equal to 5 
(It can be 7/11/13) 

A Observational hydrologic data RX(I) 

(Not aore than 600; 

seasonsCin row)/years< in column) 

Have you got such an input file ? If yes, type 1 
If you want to quit ,type 2 
If you want to go to previous menu type 3 

1 

For which season do you want to do the analysis ? , 

Type the serial no of the season 


8 

THIS COMPUTATION IS FOR SEASON NO. .8 

A A 

A MASTER-MENU A 

* ^ 

A 1. PRELIMINARY ANALYSIS A 

A 2. CHOICE OF DISTRIBUTIONS A 

A 3. TESTS FOR GOODNESS OF FIT A 

A 4. PLOTTING OF RESULTS A 

A 5. QUIT A 

/ A 6. HELP FROM EXPERT SYSTEM INTERFACE A 

A 7. ANALYSIS FOR OTHER SEASONS A 

^ ^ 


AAAAAAA AAAAAAA AAAAAAAAAAAAAA AAAAAAA AAAAAAAAAAAAAA 
Type the serial number of desired option 


I 


A 11$ PRELIHINARY ANALYSIS Hi A 

^ ^ 

A 1. STATISTICAL PARAHETERS A 

A 2. FREQUENCY PLOT A 

A 3.C.D.F.PL0T A 

A 4. TRANSFORMATIONS A 

A 5. HELP FROM EXPERT SYSTEM INTERFACE A 

A 6. TO MASTER MENU A 

A A 


Type the serial nu»ber of desired option 
1 

MEAN= 22.0135 

VAR= 0.780G3E+C2 
SKEU= C.5296SE“01 
STIiE'v'= C.8S354E4C1 

Kl!RTCSIC= 0.2685CE+01 AUG.DEVIATION= 0.71443E+0 
THIS ANALYSIS IS FOR LOGTRANSFOrMED DATA 
MEAN= 4.6867 

UAR= 0.53217E+01 
SKEW= 0.43273E-01 
STDEV= 0.23069E4C] 

KURTOSIS= 0.>9095E40) 

XMIN=-0.35667E+00 XMAX= 0.91461E401 

MEAN, AND, STDEV FROM SCREEN 
4.6867 

2.3069 

FOR ORIGINAL DATAAAAA 
CHECK FOR OUTLIERS BY WRC METHOD 
XMIN= 0.36000E+01 XMAX= 0.41900E+02 
No. of high outliers = 0. 

No. of low outliers= 0. 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A Hi PRELIMINARY ANALYSIS i%% A 

^ ^ 

A 1. STATISTICAL PARAMETERS A 

A 2. FREQUENCY PLOT A 

A 3.C.D.F.PL0T A 

A 4. TRANSFORMATIONS A 

A 5. HELP FROM EXPERT SYSTEM INTERFACE A 

A 6. TO MASTER MENU A 

* ^ 

Type the serial number of desired option 
5 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A %% EXPERT SYSTEM INTERFACE H AA 

— ^ 

A l.FOR PRELIMINARY ADVICE A 

A 2. FOR SECONDARY ADVICE . A 

A 3. FOR EXPERT ANALYSIS OF RESULTS A 

A 7. TO MASTER MENU A 

* ^ 

Which help do you want? 


Type the serial number of desired option 


1 

f-0 (initial-fact) 

f-1 (start) 

f-0 (initial-fact) 

f-1 (start) 

f-2 (»pan 22.01399994) 

f-3 (skew 0.052969) 

f-4 (kurt 2.68580008) 

f-5 (stdev 8.83539963) 

f-6 (lensth of data 37) 

WELCOME 

TO 

F A C H V E S 

This expert systeit is designed for single 

peaked continuous hydrologic variables only 

IF YOU DO NOT UNDERSTANT THE QUESTION ; Type nk 

Type of data type the first letter (e/a/s) 

Whether it is exceedence OR annual ?5 
Do you think that outliers are present? no 
NOKMAL DISIRtEUIION IS WELL SUITED 

l.You can fit one distribution for all seasonsjOR 

2. You can fit different distributions for different 
seasons ;0R 

3. You can fit all available distributions for all seasons 
PROCESS IS BEING STOPPED; 

RESET THE PROCESS IF YOU WANT TO CONTINUE 
WITH OTHER SET OF DATA 
Do you want any help ? no 

Suggestions passed should be taken in sequencial order 

f-0 (initial-fact) 

f-1 (start) 

f-2 (Bean 22.01399994) 

f-3 (skew 0.052969) 

f-4 (kurt 2,68580008) 

f-5 (stdev 8.83539963) 

f-6 (length of data 37) 

f-7 (type of data s) 

f-8 (process continue yes) 

f-9 (data is large enough) 

f-10 (outliers are no) 

f-11 (suggestion 6) 

f-i2 (esad ND) 

f-13 (stop) 

f-14 (help no) 

Now you are out of clips 
Value passed = 6.000000 

Name passed is = 

ND 


^ ^ 

A MASTER-MENU A 

jj. ^ 

A 1. PRELIMINARY ANALYSIS A 

A 2. CHOICE OF DISTRIBUTIONS A 

A 3. TESTS FOR GOODNESS OF FIT A 

A 4. PLOTTING OF RESULTS A 

A 5. QUIT A 

A 6. HELP FROM EXPERT SYSTEM INTERFACE A 

A 7. ANALYSIS FOR OTHER SEASONS A 

3 ^ 


AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
VALUE PASSED IS 6.000000 
NAME PASSED IS 
ND 

Type the serial number of desired option 
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2 


A i$ METHOn OF ANALYSIS ft A 

^ * 

A 1. METHOD OF MOMENTS A 

A 2. METHOD OF MAXIMUM LIKELYHDOD A 

A 3. METHOD OF LEAST-SQUARES A 

A 4. TO MASTER MENU A 

A-^ A 

Which iiethod do you like? 


Type the serial number of desired option 
1 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AA ft CHOICE OF DISTRIBUTION tt AA 


^ ^ 

A 1. NORMAL A 

A 2. LOG-NORMAL (2-PARAMETER) A 

A 3. LOG-NORMAL (3-PARAMETER) A 

A 4. PEARSON TYPE 3 A 

A 5.L0G-PEARS0N TYPE-3 A 

A 6.EXTREME-0ALUE TYPE-1 A 

A 7. EXTREME-VALUE TYPE-S A 

A 8. FAP— PROGRAMS A 

A 9. TO MASTER ilENU A 

A 10. TRANSFORMATIONS A 

A 11. METHOD OF ANALYSIS A 

* ^ 


If you want to transforw the dataAATYPE-lO 

If not, then which distribution? type the serial number 

B 

It fits normal distribution with or withouttransformations 

K0UNT=1 — for normal distribution 

K0UNT=2 — for inverse pearson transforiiation 

K0UNT=3 — for squareroot transformation 

K0UNT=4 — for log-normal distribution(MLE) 

K0UNT=5 — for pearson distribution 
K0UNT=6 — for log-tran5formation(M0M) 

K0UNT=7 to quit from fap 

What is yours? 

K0UNT= 

1 

WISH YOU GOOD LUCK 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A ff TEST OF GOODNESS OF FIT ft AA 

^ 

A 1. CHI-SQUARE TEST A 

A 2.K0LMO6OR0V-SMIRN0V TEST A 

A 3. TO PLOTTING RESULTS A 

A 4.COMPARISION OF STANDARD ERROR A 

A OF ESTIMATE FOR DIFFERENT A 

A DISTRIBUTIONS A 

A 5. TO MASTER MENU A 

A 6 . HELP FROM EXPFRT-SYSTEM INTERFACE A 

A A 

Type the serial number of desired option 



CKi>£ouAKr sTAiirnr 


KG. OF FFGFFFS OF Ffr-rOf' 
kkkkkkkkkkkkkkkkkklkr ^ ^ a k 

1 

kkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk 
kk it PLOTTING RESULTS k 

k k 

k 1. FREQUENCY PLOT k 

k 2.C.D.F.PL0T k 

k 3. TO HASTER HENU k 

^ ^ 

Type the serial number of desired option 
3 

* ^ 

* hASTER“hENU k 

k k 

k 1. PRELIMINARY ANALYSIS k 

k 2. CHOICE OF DISTRIBUTIONS k 

k 3. TESTS FOR GOODNESS OF FIT * 

* 4. PLOTTING OF RESULTS k 

k 5. QUIT 

k G.HELP FROM EXPERT SYSTEM INTERFAr: 

^ ^ 

kkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk 
VALUE PASSED IS 6.000000 
• NAME PASSED IS 
ND 

Type the serial number of desired option 
5 

FORTRAN STOP 

$ 

ANALYSIS FOE NORMAL DISTRIBUTION 
kkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk 

SEASONAL AVERAGES 
kkkkkkkkkkkkkkkkkk 

22.014 

SEASONAL STANDARD DEVIATION 
kkkkkkkkkkkkkkkkkkkkkkkkkkkk 

8.835 

SEASONAL COEFFICIENT OF SKEWNESS 

0.053 

CHI-SQUARE STATISTIC 
AAAAAAAAAAAAAAAAAAAAA 

1.723 

NO. OF DEGREES OF FREEDOM 
AAAAAAAAAAAAAAAAAAAAAAAAAA 



CHAPTER 6 


SUMMARY, CONCLUSIONS , SUGGESTIONS 


5.1 Summarsr 

Expert. Systems are becoming very popular in several 
knowledge -rich domains and therefore also in water resources 
engineering. They are applied either to solve a problem making 
use of computerised human expertise; or to assist a problem 
solving procedure by proper advice based on heuristic expert 
knowledge, especially to help non experts solve a domain 
specific problem. 

Out of several expert system tools CLIPS has been chosen 
the present study for its ability to communicate with exter 
programs written in other high level languages, enhan 
capability of handling numerical values, easy installati 
economy in terms of memory requirements and compatibility w 
several machines available for the study. 

Frequency analysis is an importauit step in engineer 
design and planning, particularly in water resources engineeri 
where estimation of future hydrologic phenomena is based on p 
observations. Being a data dependent technique the reliability 
frequency analysis is always questionable. To fit a theoreti 
probability distribution function to a set of observations and 
draw conclusions therefrom, requires a combination of rout 
numerical/statistical computations and human expertise in the f 
of heuristics. 
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In order to address these problems a program, Frequency 
Analysis of Continuous Hydrologic Variables with an embedded 
Expert System (FACHVES) has been developed. This program was 
developed in FORTRAN for microVAX-VMS environment. It consists of 
methods for computerised statistical analysis supported by an 
expert decision support system. The framework consists of four 
parts: main program , subroutines , ES interface, and a knowledge 

base. 


The developmental work completed so far is preliminary in 
nature and it provides the framework for further development of a 
more comprehensive computer program package. Two data sets from 
field were tested for evaluating the performance of the program 
developed so far. 

Due to easy portability of CLIPS, C, and FORTRAN this package 
can be easily transported to other systems such as UNIX HP-9000 or 
IBM PC. 

S . 2 Conclusions 

Frequency analysis of hydrologic data is a knowledge 
domain where lot of decisions are to be taken based on human 
intuition , experience , and subjective judgments. 

The developed package for FACHVES was tested with two 
iifferent data sets representing recorded streamflow data at two 
lifferent sites in India. The results of this testing show the 
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proper functioning of the package at this stage of development. 
These results also established the feasibility of this approach 
for statistical analysis of the hydrologic data by a combination 
of computational procedures and expert advice . 

Suggestions For Future Study 

A huge amount of expert knowledge is required and 
ivailable for frequency analysis of hydrologic data. This 

ssentially preliminary study incorporates only a small fraction 
f these methods and expertise. Substantial additional study is 
ecessary to make the developed package suitable for practical 
se. In particular, the following tasks are necessary: 

1 . Further enhancement of the knowledge base of the ES 

2. Addition of other computational methods for 
statistical analysis 

3. Enhancement of the graphical capabilities; and 

4. Implementation of the package in other computing 


environments . 
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APPENDIX A 


♦include 'clips-h* 
runnel ips ( liKit) 
int Alimit; 

< 

run(Aliiit) ; 

> 

c_3dvice( ) 

< 

float num; 
char Anafte; 
nuBi=rf loat (1 ) ; 
naBe=rstrin 9 ( 2 ) ; 
advice( inu»,naine) ; 
return(O) ; 

> 

extern int parameter () ; 
usrfuncs( ) 

-C 

def i ne_f unction ( 'parameter • , " ^parameter , 'parameter ' ) 
define function( 'advice' , advice, 'c^advice' ) ; 

y - 
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APPENDIX - B 
VAX-VMS Linking Commands 

1 ) copy all the CLIPS Include files to your directory: 

Scopy [{CLIPS master di rectory} J*.h [(your directory}} 

2) create an object file from runclips.c: 

$cc runclips.c 

3) compile the FORTRAN routine (s) 

Sforlran {FORTRAhf files} 

4) link all the files together. 

$llnk/execul8ble={exec name} {FORTRAN files}, runcllps, [{CLIPS 
master directory}] cllpsllb/llbrary 

Note that one of the FORTRAN programs must be a main program. The CLIPS library 
should include the loedc, and storec functions. 
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APPENDIX C 

ES Interface 28-Kov-1990 09:33:56 VAX FORTRAN 

28-NOV-1990 09:33:27 iriISK2:[JK.MISHFA3FDRCLIP.F0I<;35 


C EXPERT SYSTEH INTERFACE 

C AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

SUBROUTINE PTHELP(A,SK,ST,V, AK,N) 

CHARACTER A50 FACTl, FACT2,FACT3,FACT4, FACTS, INT^FILE, FILE 
INTEGER C„FACT,C_FILE,K,CC FACT,CCC FACT,CCF_^FACT,CCN FACT 
1000 F0RHAT(E12.5) 

2000 FORMAT (IS) 

IF ( KKK . NE . 1 ) CALL RESET^CL IPS 
CALL INIT^CLIFS 
FILE= 'FACHVES.CLP^ 

CALL STOREC(FILE,C FILE) 

K=LOAri__RULES(C_FILE) 

CALL RESET_CLIFS 

CALL DISFLAYFACTSO 

U'FITE(IK'T„FILE,1000)A 

FACTl = 'Bie3n V/INT_.FILE( 1 : 12) 

CALL ST0REC(FACT1,C_FACT) 

CALL ASSERT (C^^FACT) 

WRITE(INT_FILE,1000)SK 
FACT2='skew V/INT FILE(1:12> 

CALL ST0REC(FACT2,CC FACT) 

CALL ASSERTCCC FACT)” 

WRITE(INT_FILE^1000)AK 
FACT3=^kurt V/INT_FILE(1:12> 

CALL STQREC(FACT3, CCCJACT) 

CALL ASSERKCCC^FACT) 

WRITEdNT FILE,1000)ST 
FACT4=^stdev V/INT_FILE(1 : 12 ) 

CALL ST0REC(FACT4, CCF^FACT) 

CALL ASSERT(CCF FACT) 

WRITE(INT_FILE,2000)N 

FACT5='ler,gth of data V/INT FILE(1:5) 

CALL ST0REC<FACT5, CCN FACT) 

CALL ASSERT (CCN_FACT) 

CALL DISFLAYFACTSO 
k=RUN^CLIPS(“l) 

CALL DISFLAYFACTSO 

TYPE A, "Now you are out of clips" 

RETURN 

END 



28'“Nov-1990 09:33:56 
28-NOV-1990 09:33:27 


VAX FORTRAN V4. 6-244 
$r2SK2:CJK.«ISHF:A3FORCLIP.FOR;3: 


0001 

c 



0002 

C 



0003 


SUBROUTINE STORECC CHARS, CSTPING) 


0004 


INTEGER CSTRING (1) 


0005 


CHARACTER A iA) CHARS 


0006 


IWORD =1 


0007 


IBIT = 0 


0008 


K = LENGTH (CHARS) 


0009 


DO 100 1=1, K 


0010 


IVALUE=ICHAR(CHARS(I:I)) 


0011 


CALL HVBIIS (IVALUE,0,8,CSTRIN0(IW0Rri),IEIT) 


0012 


IBn=IBII+8 


0013 


IFCIBIT -LI. 32)60 TO 100 


0014 


w 

i-i 

II 

o 


0015 


IW0RD=IW0RD +1 


0016 

100 

CONTINUE 


0017 


CALL HVBITS(0,0,8,CSTRING(IUORri),IBn) 


0018 


RETURN 


0019 


END 




28-NDV-1990 09:33:27 

4iaSK2:CJK.HISHRA3FORCLIF.FOE;35 

0001 

C 



0002 

C 

AAAAAAAAAAAAAAAAkAAAAA 


0003 

c 



0004 


INTEGER FUNCTION LENGTH(STRING) 


0005 


CHARACTER A (A) STRING 


0006 


K=LEN( STRING) 


0007 


DO 100 I=K,1,-1 


0008 


IF(STRING(I:I) -.NE.' ') GO TO 150 


0009 

100 

CONTINUE 


0010 

150 

CONTINUE 


0011 


LENGTH=I 


0012 


RETURN 


0013 


END 




28-NOV-1990 09:33:27 

il! ISK2 : C JK . M ISHRA3FORCL IP - FOR ; 35 

0001 

C 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


0002 


SUBROUTINE LOADC (CHARS, CSTRING) 


0003 


INTEGER CSTRING (1) 


0004 


CHARACTER A 1 CHAR 


0005 

/ 

CHARACTER A iA) CHARS 


0006 


IU0RD=1 


0007 


IVALUE=0 


0008 


CHARS=' ' 


0009 


IBIT=0 


0010 


K=LEN (CHARS) 


0011 


DO 100 1=1, K 


0012 


CALL MVBITS(CSTRING(IW0RD),IBIT,8,IVALUE,O) 


0013 


IFdVALUE -LX. 32)G0 TO 200 


0014 


CHARS(I:I)=CHAR(IVALUE) 


0015 


IBIT=IBIT+8 


0016 


IFdBII.LT. 32)60 TO 100 


0017 


IBIT=0 


0018 


I«0Rri=I«0RD-H 


0019 

100 

CONTINUE 


0020 

200 

CONTINUE 


0021 


RETURN 


0022 


END 
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VAX FORTRAN V4. 6-244 
$I!ISK2:CJK.KISHRA3FOFCLIF.FOB; 


0001 c 

0002 

0003 

0004 

0005 

0006 
0007 
0006 

0009 

0010 

0011 c 

0012 c 

0013 

0014 


SUBROUTINE ADVIC£( VALUE, C^NAKE ) 

INEEGEr C NAME 

REAL VALUf 

CHARACTER ABO NAME 

character ACO c_c_rian'e 

comri.on/cl ret/v_v3lue , c_c riar.,e 

CALL LOA^;C(^A^fF,C_NAMF) 

C_C_-.3n:?=ri5-.F 

v_value=value 

TYPE A,'NAME=%KAME 

TYPE A, 'VALUED', VALUE 

RETURN 

END 


28“Nov-1990 09:33:56 
28-NOV-1990 09:33:27 


VAX FORTRAN V4. 6-244 
iIiISK2:£aK.HISHRA3FORCLIP.FOR;3 
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0001 

0002 

0003 

0004 

0005 

0006 

0007 

0008 

0009 

0010 
0011 
0012 

0013 

0014 

0015 

0016 

0017 

0018 

0019 

0020 
0021 
0022 

0023 

0024 

0025 

0026 

0027 

0028 

0029 

0030 

0031 

0032 

0033 

0034 

0035 

0036 

0037 

0038 

0039 

0040 

0041 

0042 

0043 

0044 

0045 

0046 

0047 

0048 

0049 

0050 

0051 

0052 

0053 

0054 

0055 

0056 

0057 


APPENDIX D 

Main Program 
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VAX FORTRAN V4. 6-244 
ir!lSK2:CaK.HISHRA9FORCLIP.FO 


C A TESTING PROGRAM 

REAL KURT 

CHARACTER A80 C„C NAME 
C0MM0N/BLACK1/X’<600),N,TITLE<80) 

C0MM0N/BL0CK1/NS,NPT,IALT 
COMMON/CL RET/V VALUE, C^C NAME 
C0HM0N/BL0CK2/RX(600),NX,HEADING(80) 

DIMENSION TX(600) 

TYPE A, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 

TYPE A, 'A MASTERS THESIS OF O.P.MISHRA AA' 

TYPE A, 'A UNDER THE GUIDANCE OF AA' 

TYPE A, 'A Dr .S.RaBaseshan and Dr.B.Dutta AA' 

TYPE A, 'A at I. I. T. KANPUR AA' 

TYPE A,'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 

TYPE A, 'A WISH YOU HAPPY COMPUTING AA' 

TYPE A, 'A WITH AA' 

TYPE A, 'A F A C H V E S AA' 

TYPE A, ' AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 

TYPE A, 'A A— A FREQUENCY-ANALYSIS WITH EXPERT ADVICE A— -A' 

TYPE A, 'A - A' 

II TYPE A, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 

TYPE A, 'A INPUT MODULE A' 

TYPE A, 'A A' 

TYPE A, 'Do you want to sive input on the screen?' 

TYPE A, 'If yes, Type 1 ,If no, Type 2' 

TYPE A,' If you want to quit Type 3' 

READ(A,A)INP 

GO TO <111,19,9)INP- 

19 WRITE(A,A>'— The input file should be named-as TEST.INP' 

WRITE(A,A)' — It Bust have following :' 

TYPE A,'— A a TITLE with format 80A1 ' 

TYPE A,' — AND, unforaated ' 

TYPE A,' — A NX — Number of data ' 

TYPE A,' — A NS — NuBber of seasons ' 

TYPE A,' — (for annual series NS =1) ' 

TYPE A,'— A NPT— Number of points 

TYPE A,'— NPT=NCLASS+1 where NCLASS is number of' 

TYPE A,' classes in which data can be divided' 

TYPE A,' for Chisquare test. The class frequency' 

TYPE A,' should be equal to 5' 

TYPE A,'— (It can be 7/11/13) 

TYPE A,' — A Observational hydrologic data RX(I)' 

TYPE A,' — (Not Bore than 600;' 

TYPE A,' seasonsdn row)/years(in column)' 

TYPE A, 'Have you got such an input file ? If yes, type 1' 

TYPE A,' If you want to quit ,type 2' 

TYPE A,' If you want to go to previous menu type 3' 

READ(A,A)IC 
GO TO (110,90,11)IC 
no CALL INPUT 

TITLE(80)=HEADING(80) 

REWIND 20 
GO TO 112 

III WRITE(A, A) 'LENGTH OF DATA?' 

READ(A,A)NX 

WRITE(A,A)'DATA; X(I)=' 

READ(A,A)(RX(I),I=1,NX) 
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FOIcCilPIHAIN 

28-Ncv- 

■1990 09:33:56 

VAX FORTRAN V4. 6-244 



2e-Kov- 

■1990 09:33:27 

$BISK2:[JK.MISHRA3F0B:CLIF.F0P;3' 

0058 


1ALI=0 



0059 


«RITECA,*)'GIME NS, NPT' 



0060 


ypITEC A, A) 'NS--No. of seasons ,HPT-No of points' 



0061 


EEAI.(A,^.}KS,NPI 



0062 

112 

N=K'X/K£ 



0063 

113 

IFCNS.GT.DTHEN 



0064 


iilF:IT£(A, A) 'For which seascrs do you want to do the analysis ?' 


0065 


ypITSn,*) 'Type the serial no of the season' 



0066 


1EABC*,*)NC 



0067 


IF(NC.GI.MS)B0 TO 180 



0068 


ELSE 



0069 


NC=1 



0070 


END IF 



0071 


LK=(NC-1>*H 



0072 


BO 12 IJK=LK+i,LK+H 



0073 

12 

X(IJK-LK>=RX<IJK) 



0074 


IF(NS.GT.l)ypITE<^,lin)NC 



0075 

nil 

F0R«A1(4X,'THIS COHPUTATION IS FOP SEASON NO. ',51//) 



0076 

9 

TYPE A,'* — - 

-A' 


0077 


TYPE Af'A HASTEP-KENU 

A' 


0078 


TYPE 

-A' 


0079 


TYPE *,'A I.PREIIHINAPY ANALYSIS 

A' 


0080 


TYPE A, 'A 2. CHOICE OF DISTPIBUTIONS 

A' 


0081 


TYPE A, 'A 3. TESTS FOP GOCBNESS OF FIT 

A' 


0082 


TYPE A, 'A 4. PLOTTING OF RESULTS 

A' 


0083 


TYPE A, 'A 5. QUIT 

A' 


0084 


TYPE A, 'A 6. HELP FROH EXPEPT SYSTEM INTERFACE 

A' 


0085 


IF(NS.GT.1)THEN 



0086 


TYPE A, 'A 7. ANALYSIS FOR OTHER SEASONS 

A' 


0087 


ENBIF 



0088 


TYPE A, 'A 

•'A' 


0089 


TYPE A, ' AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 


0090 


IF(IES.NE.O)XHEN 



0091 


TYPE A, 'VALUE PASSED IS ',V VALUE 



0092 


TYPE A, 'NAME PASSED IS ',C C NAME 



0093 


ENBIF 



0094 


TYPE A, 'Type the serial nuiber of desired option' 



0095 


READ(A,A)MH 



0096 


IF<MM.E8.7>G0 TO 113 



0097 


60 TO (20,40,50,60,90, 100)MM 



0098 

20 

TYPE A,'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 



0099 


TYPE A, 'A $$$ PRELIMINARY ANALYSIS %$% A' 



0100 


TYPE A, 'A A' 



0101 


TYPE A, 'A 1. STATISTICAL PARAMETERS A' 



0102 


TYPE A, 'A 2 -FREQUENCY PLOT A' 



0103 


TYPE A, 'A 3.C.B.F.PL0T A' 



0104 


TYPE A, 'A 4. TRANSFORMATIONS A' 



0105 


TYPE A, 'A 5. HELP FROM EXPERT SYSTEM INTERFACE A' 



0106 


TYPE A, 'A 6. TO MASTER MENU A' 



0107 


TYPE A, 'A — - A' 



0108 


TYPE A, 'Type' the serial nu«ber of desired option' 



0109 


IEAIiCA,A)IPA 



0110 


60 TO (2I,22,22,31,100,9)IPA 



0111 

21 

CALL PARAMCX,N,AMEAN, VAR, SKEW, KURT, STBEV) 



0112 


SO TO 20 



0113 

22 

60 TO 60 



OlH 

30 

TYPE A , ' AAA AAA AAAAAAAA A A AAAAAA A AAAAA AA AAA AAAAAAAA ' 





FORCLIPiHAIN 



28-NOV-1990 

09:33:56 





28-KOV-1990 

09:33:27 

0115 


TYPE A/AA ii CHOICE OF DISTRIBUTION 

%% 

AA' 


Oils 


TYPE A/ A 


A' 


0117 


TYPE A/A 1. NORMAL 

A' 



0118 


TYPE A, 'A 2. LOG-NORMAL (2-PARAMETER) 

A' 



0119 


TYPE A/A 3. LOG-NORMAL (3-PARAMETER) 

A' 



0120 


TYPE A, 'A 4. PEARSON TYPE 3 

A' 



0121 


TYPE A, 'A 5.L0G-PEARS0N TYPE-3 

A' 



0122 


TYPE A, 'A 6. EXTREME-VALUE TYPE-1 

A' 



0123 


TYPE A/A 7. EXTREME-VALUE TYPE-3 

A' 



0124 


TYPE A, 'A 8. FAP— PROGRAMS 

A' 



0125 


TYPE A, 'A 9. TO MASTER MENU 

A' 



0126 


TYPE A/A 10. TRANSFORMATIONS 

A' 



0127 


TYPE A, 'A 11. METHOD OF ANALYSIS 

A' 



0128 


TYPE A, 'A 


A' 


0129 


TYPE A,Mf you want to transfom the dataAATYPE-10' 


0130 


TYPE A, 'If not, then which distribution? type 

the serial number' 

0131 


]i£AD(*,*>IOC 




0132 


GO 10 (32, 33, 3-4, 35, 36, 37, 38, 39, 9, 31, 40)100 



0133 

32 

CALL ND 




0134 


60 TO 50 




0135 

33 

CALL LN2(METH0D) 




0136 


GO TO 50 




0137 

34 

CALL LN3(METH0D) 




0138 


60 TO 50 




0139 

35 

CALL PT3 (METHOD) 




0140 


60 TO 50 




0141 

36 

CALL LP3(METH0D) 




0142 


60 TO 50 




0143 

37 

CALL TIE(METHOD) 




0144 


60 TO 50 




0145 

38 

CALL T3E( METHOD) 




0146 


GO TO 50 




0147 

39 

URITE(A,A) ' It fits nor»al distribution 

with 1 

or without 


0148 


Itransfonations' 




0149 


CALL FAP 




0150 


GO TO 50 




0151 

31 

TYPE A, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 


0152 


TYPE A, 'A TRANSFORMATIONS 


A' 


0153 


TYPE 


—A' 


0154 


TYPE A, 'A 1. LOG-TRANSFORMATION 


A' 


0155 


TYPE A, 'A 2.PEARSON-TRANSFORMATION 


A' 


0156 


TYPE A, 'A 3. SQUARE-ROOT-TRANSFORMATION 

A' 


0157 


TYPE A, 'A 4. POWER-TRANSFORMATION 


A' 


0158 


TYPE A, 'A 5. CHOICE OF DISTRIBUTION 


A' 


0159 


TYPE A, 'A 6. TO MASTER MENU 


A' 


0160 


TYPE A, 'A 


—A' 


0161 


TYPE A, 'Type the serial number of desired option' 


0162 


READ(A,A)IT 




0163 


GO TO (311, 312, 313, 314, 30, 9)IT 




0164 

311 

CALL L06TRAN(N,TX,X) 




0165 


GO TO 310 




0166 

312 

CALL IPT(N,NS,AMEAN,STDEV,SKEW,X) 




0167 


DO 315 1=1, N 




0168 

315 

TX(I)=X(I) 




0169 


GO TO 310 




0170 

313 

CALL SOTRAN(N,X,TX) 




0171 


GO TO 310 





UAX FORTKAN V4. 6-244 
iraSK2;CaK.MISHRA3EORCLIP.F 



FORCLIPIMAIN 


28-NOV-1990 09:33:5G VAX FORTRAN U4. 6-244 

28-NOV-1990 09:33:27 $DISK2:CJK.NISHRA3F0RCLIP .FOE; 


0172 

0173 

0174 

0175 

0176 

0177 

0178 

0179 

0180 
0181 
0182 

0183 

0184 

0185 

0186 

0187 

0188 

0189 

0190 

0191 

0192 

0193 

0194 

0195 

0196 

0197 

0198 

0199 

0200 
0201 
0202 

0203 

0204 

0205 

0206 

0207 

0208 

0209 

0210 
0211 
0212 

0213 

0214 

0215 

0216 

0217 

0218 

0219 

0220 
0221 
0222 

0223 

0224 

0225 

0226 

0227 

0228 


314 CALL TSPT(X,N,TX) 

310 «PITE(A,*)(TX(I),I=1,N) 

DO 316 1=1, N 
316 X(I)=TX(I) 

60 TO 30 

40 TYFE A, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 


TYPE A,^A if METHOD OF ANALYSIS if A' 

TYPE A, 'A A' 

TYPE A, 'A 1. METHOD OF MOMENTS A" 

TYPE A, 'A 2. METHOD OF MAXIMUM LIKELYHOOD A' 

TYPE A, 'A 3. METHOD OF LEAST-SQUARES A' 

TYPE A, 'A 4. TO MASTER MENU A' 

TYPE A, 'A A' 

TYPE A, 'Which »ethod do you like?' 


TYPE A, 'Type the serial number of desired option' 

READ(A,A)MA 

60 TO (41,42,43,9)MA 

41 METH0D=1 
GO TO 30 

42 METH0D=2 
GO TO 30 

43 WRITE( A,A) ' It is only available for Normal Distribution' 

WRITE(A,A) 'For other cases pretransform the data into normal 
1 variables' 

URITE(A,A) 'Do you want to use it ? If yes type 1' 

READ(A,A)MLSQ 

IFCMLSQ.EQ. DIHEN 

CALL MLS 

END IF 

60 TO 40 

50 TYPE A, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 


TYPE A, 'A if TEST OF GOODNESS OF FIT if AA' 

TYPE A, 'A A' 

TYPE A, 'A 1. CHI-SQUARE TEST A' 

TYPE A, 'A 2.KOLMOQOROV-SMIRNOV TEST A' 

TYPE A, 'A 3. TO PLOTTING RESULTS A' 

TYPE A, 'A 4.C0MPARISI0N OF STANDARD ERROR A' 

TYPE A, 'A OF ESTIMATE FOR DIFFERENT A' 

TYPE A, 'A DISTRIBUTIONS A' 

TYPE A, 'A 5. TO MASTER MENU A' 

TYPE A, 'A 6. HELP FROM EXPERT-SYSTEM INTERFACE A' 

TYPE A, 'A A' 


TYPE A, 'Type the serial number of desired option' 
READ(A,A)ITOG 

60 TO (51,52,53,54,9,100)1106 
51 IF(IT.EQ.0)G0 TO 512 

IF(IT-4)511,511,512 

512 DO 513 1=1, N 

513 IX(I)=X(I) 

51 1 CALL CH ISQRT ( 8 , AME AN , SIDE V , TX , N , X ) 

60 TO 60 

52 CALL K0LM6<X,N,D) 

GO TO 60 

53 TYPE A, 'Are you satisfied? If yes type 1' 

READ(A,A)IS 
IF (IS. EG. 1)60 TO 60 
60 TO 9 ’ 



FOFCLIPiMAIN 


28-NOV-1990 09:33:56 
28-Nov‘-1990 09:33:27 


VAX FORTRAN V4. 6-244 
iriISK2:C3K,HISHRA3FORCLIP.FOR;35 


0229 

0230 

0231 

0232 

0233 

0234 

0235 

0236 

0237 

0238 

0239 

0240 

0241 

0242 

0243 

0244 

0245 

0246 

0247 

0248 

0249 

0250 

0251 

0252 

0253 

0254 

0255 

0256 

0257 

0258 

0259 

0260 
0261 
0262 

0263 

0264 

0265 

0266 

0267 

0268 

0269 

0270 

0271 

0272 

0273 

0274 

0275 

0276 

0277 

0278 

0279 


54 

60 


61 

62 

100 


101 

102 

103 

180 


CALL SER 
60 TO 9 

TYPE A, '**)icA*AAAA*A***AAA*A**A*AAAA*y^A*AA*A****A**AAA**A' 


TYPE A/AA i% PLOTTING RESULTS A' 

TYPE A, 'A A' 

TYPE A, 'A 1. FREQUENCY PLOT A' 

TYPE A,^A 2.C.D.F.PL0T A^ 

TYPE A/ A 3. TO MASTER MENU A" 

TYPE A, 'A A' 


TYPE A, 'Type the serial number of desired option' 

REAIi(A,A)IPR 

GO TO (61,62,9) IPR 

TYPE A, 'NOT INCORPORATED AS YET' 

TYPE A, 'NOT INCORPORATED AS YET' 

IF(AMEAN.EQ.O.O)THEN 

TYPE A, 'Expert system interface is only available' 

TYPE A,' when statistical parameters are known' 

GO TO 20 
END IF 

TYPE A,'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 
TYPE A, 'A %% EXPERT SYSTEM INTERFACE $$ AA' 

TYPE A, 'A A' 

TYPE A, 'A l.FOR PRELIMINARY ADVICE A' 

TYPE A, 'A 2. FOR SECONDARY ADVICE A' 

TYPE A, 'A 3. FOR EXPERT ANALYSIS OF RESULTS A' 

TYPE A, 'A 7. TO MASTER MENU A' 

TYPE A, 'A A' 

TYPE A, 'Which help do you want?' 

TYPE A, 'Type the serial number of desired option' 

READ(A,A)IES 

IFdES.EQ.DGO TO 101 

60 TO <101,102,103,9)IES 

IF(IES.EQ.7)60 TO 9 

CALL PTHELP(AMEAN, SKEW, STDEV, VAR, KURT, N) 

TYPE A, 'Value passed - ',v_value 
TYPE A, 'Name passed is = ',c__c_name 
GO TO 9 

TYPE A, 'NOT INCORPORATED AS YET' 

TYPE A, 'NOT INCORPORATED AS YET' 

60 TO 9 ^ 

TYPE A, 'There is no more seasons.' 


TYPE A, 'A A' 

TYPE A, 'A l.TO MASTER MENU A' 
TYPE A, 'A 2. TO INPUT MODULE A' 
TYPE A, 'A 3. TO QUIT A' 
TYPE A, 'A A' 


TYPE A, 'Type the serial number of desired option' 

READ(A,A)KL 

60 T0(9,11,90)KL 

STOP 

END 


90 



APPENDIX E 

KBES 


(deffacis a 
(start) ) 


(defrule rulel 
(start) 

=> 


(f printout t 
(fprintout t 
(f print out t 
(fprintout t 
(fprintout t 
(fprintout t 
(fprintout t 

(fprintout t 
(fprintout t 
(fprintout t 
(fprintout t 

(assert (type 


• WELCOME 

•crlf) 


•crlf) 

• TO 

•crlf) 


•crlf) 

• F A C H V E S 

•crlf) 


•crlf) 

'This expert system is designed for 

single 

peaked continuous hydrologic variables 

only 'cr If ) 


• 'crlf) 

•IF YOU DO NOT UNDERSXANT THE QUESTION ; Type nk’crlf) 
’ 'crlf) 


•Type of data type the first letter (e/a/s) 

Whether it is exceedence OR annual ?•) 
of data =(re3d)))) 


(defrule rule2 
(length of data ?n) 

(type of data ?t> 

=> 

(if (< ?n 10) 
then 

(fprintout t ’No analysis is possible •) 

(assert(data is too small)) 

(assert (process continue no)) 

(bind ?riui« 1.0) 

(assert (suggestion Tnum)) 

(assert(esad KA)) 

(assert (stop) ) 
else 

(if (< ?n 25) 
then 

(fprintout t 'Results are suspected ;Do you want to continue ? *) 
(assert(process continue =(read))) 

(assertdnsufficxent data)) 

(bind ?nura 2.0) 

(assert (suggestion ?num)) 

(assert(esad RS)) 
else 

(assert (process continue yes)) 

(assert(d3ta is large enough))))) 



<defrule rule3 
(declare(s3liehce 10)) 

(process continue ?pc) 

(test (eq ?pc yes)) 

=> 

(fprintout t 'Do you think that outliers are present? •) 
(assert(outliers are =(read)))) 

(defrule rule4 
(outliers are ?o«) 

(or(data is large enough) 

(insufficient data)) 

(process continue ?pc) 

(test (eq ?pc yes)) 

=> 

(if (eq ? 0 B) yes) 
then 

(assert(outliers yes)) 
else 

(if (eq ?o« nk) 
then 

(f printout t *In absence of information outliers are 
assumed not to be present'crlf ) 
(assert(outliers no))))) 


(defrule ruleS 
(type of data ?t) 

(or(data is large enough) 

(insufficient data)) 

(process continue ?pc) 

(test (eq ?pc yes)) 

(outliers ?o) 

=> 

(if (eq yes ?o) 
then 

(assert (outliers present)) 
else 

(if (eq ?t e) 
then 

(fprintout t 'Negative exponential distribution is well suited'crlf) 
(bind ?nu® 4-0) 

(assert(suggestion ?nu»)) 

(assert(esad NED)) 
else 

(fprintout t “Here is the list of steps 

1. AADO THE milMINARY ANALYSIS 

2. AAFIT A SUITABLE DISTRIBUTION 
(GENERALLY A 2 PARAMETER DISTRIBUTION 

3. AATEST FOR GOODNESS OF FIT 

4. AAC0KPARE THE RESULTS'crlf ) ) ) ) 
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(defrule rule6 
(process continue ?pc> 

(outliers present) 

(type of data ?t) 

(outliers ?o) 

(test (eq ?o yes) ) 

(test (eq ?pc yes)) 

*> 

(if (eq ?t e) 
then 

(fprintout t *You should use uethod of least squares for 
paraneter esti«atiori AND also use negative 
exponential distribution'crlf ) 

(assert(stop) ) ^ 

(bind ?nu» 28-0) 

(assert(su 99 estion ?nu»)) 

(assert(esad MLSNED)) 
else 

(fprintout t 'You should use method of least squares for 
parameter estimation AND also use normal 
distribution ;You should transform the data 
essentially to normal distribution using 
suitable transformations 'crlf) 

(assert(stop) ) 

(bind ?nu» 4.0) 

(assert (suggestion ?nu«)) 

(assert (esad HLSNIO))) 


(defrule rule? 

(process continue ?pc) 

(test (eq ?pc yes)) 

(or(dat3 is large enough) 

(insufficient data)) 

(not(stop) ) 

(kurt ?k) 

(mean Tam) 

(skew ?sk) 

(stdev ?st) 

(testdtdiO ?sk -0.5) 

(< ?sk 0.5)) 

(> 0.4(/ ?st ?am)))) 

(if « ?5k 0) 

then * 

(if (< ?st(/ Tam 3.0)) 
then 

(fprintout t •NORMAL DISTRIBUTION CAN BE APPROXIMATED'crlf ) 
(bind ?nuitt 11.0) 

(assert (suggestion ?num)) 

(assert(esad ND) ) 


else 



(if (> 

?k 

1.0) 

then 



(if (< 

?k 

2.25) 

then 





(fprirftout t ‘UNIFORM IHSIRIBUTION IS WELL SUITEP^crlf) 
(bind ?riuin 5*0) 

(assert (su 39 e£tiori Tnur;)) 

(assert(esad UIi)) 
else 

(if illi> ?k 3.5) 

« n. 10 . 0 )) 

then 

(fjprintout t H- DISTRIBUTION IS WELL SUITED'crlf) 

(bind ?nuK 7.0) 

(ass€rt(su 99 estiori ?nuirt)) 

(assert(esad tlD) 
else 

(if (t%« ?k 3.5) 

(> ?k 2.25)) 

then 

(fprintout t "NORMAL DISTRIBUTION IS WELL SUITEH'crlf ) 
(bind ?nuit! 6.0) 

(assert(5U39estion ?riuiii)) 

(assert(esad ND))))))))) 


(defrule ruleS 
(or (data is larse eriouch) 
(insufficient data)) 
(process continue ?pc) 
(test (eq T'pc yes)) 

(kurt ?k) 

(wear. ?3i«) 

(skew ?sk) 

(teste j \ «> ?5k -0.5) 

(>= ?sk 0.5))) 

(if «= ?sk -0.5) 
then 

(assert (negative 
else 

(assert(positive skew)))) 


(defrule ruleS 

(or (data is larsc enoush) 

(insufficient data)) 

(process continue ?pc) 

(test (eq ?pc yes)) 

(positive skew) 

(kurt ?k) 

(skew ?5k) 

(test(< ?k (+ (A 1.25 ?sk) 1))) 

(fprintout t ‘No distribution will be suited'crlf) 

(fprintout t ■ Check for data errors or wultiple distribution'crlf ) 
(bind ?nuiK 20.0) 

(assert (su 93 estion ?nu»)) 

(assert(esad NoD))) 



(defrule rulelO 
(positive skew) 

(process continue ?pc) 

(or (data is larse enoush) 

(insufficient data)) 

(test (e<i ?pc yes)) 

(Kurt ?k) 

(skew ?sk) 

(test<Xt(< ?k (+ (A 2.25 ?sk) 3)) 

<> ?k (+ (A 1.25 ?sk) 3)))) 

=> 

(fprintout t 'Losnoraal 2 parameter distribution is well suited'crlf) 
(bind ?nuir) 21.0) 

(assert (susgestion ?num)) 

(assert(esad LN2))) 


(defrule rulell 
(positive skew) 

(not(5top) ) 

=> 

(calKparameter) ) 

(fpr.intout t ■From the screen type standard deviation of 
logtransforaed data ?*) 

(assert(stdev of logtran data s(read)))) 


(defrule rulel2 
(positive skew) 

(stdev of logtran data ?ltst) 

=> 

(bind ?lcv (sqrt (** 2.278 (- (*A ?ltst 2.0) 1.0)))) 
(bind ?lest (A 3.0 ?lcv)) 

(bind ?lfst (AA ?lcv 3.0)) 

(bind ?lsk (+ ?lest ?lfst)) 

(assertdogtran skew is ?lsk))) 


(defrule rulel3 
(positive skew) 

(logtran skew is Tlsk) 

(coeff of var ?lcv) 

(test(li(U(> ?lsk -0.5) 

« ?lsk 0.5)) 

(> 0.4 ?lcv))) 

s> 

(3Ssert(lo3nor«al2) )‘ 

(fprintout t • Following transformations can be done 

1. Log transformation 

2. N_root transformation 

OR; You can use 2 parameter lognormal distr ibution*crl: 
(bind ?num 13.0) 

(assert(suggestion ?nuin)) 

(assert(esad LN2) ) ) 



loo 


tcpfrule rulel? 

(fDsitive skew) 

(iuTi Ik) 

(r oKstop) ) 

(lensth of data ?n) 

(testCJIO ?k, (+ 2.2 (/ 6.0 (sqrt ?n)>)) 

(< ?k (+ 3.8 (/ 6.0 (sqrt ?n)>))>> 

(fprintout t '6a»i»a distribution is well suited’crlf) 

<f printout t *Method of ttaxiRum liklihood is recoRwended'crlf ) 
(bind ?nui 10.0) 

<assert(su99estion ?nu»)) 

<assert(es3d 6D))) 


(defrule rulelS 
(nesative skew) 

(skew ?sk) 

(lean ?am) 

(fprintout t ‘YOU CAN DO FOLLOWING TRANSFORMATIONS 
l.INVERSE-PEARSON TRANSFORMATION 
2.n-R00T TRANSFORMATION 
0R;Y0u can fit either pearson type 3 
0R;Y0u can fit logpearson type 3 
OR; You can't fit lognormal 3 distribution'crlf ) 

(bind ?num 12.0) 

(assertCsuggestion ?num)) 

(assertCesad PT3,LP3,LN3, IPTRAN) ) ) 

(defrule rulel9 
(declare (salience -100)) 

(type of data ?t) 

(process continue ?pc) 

(test (eq ?pc yes)) 

(:f (eq ?t s) 
then 

(fprintout t *l.You can fit one distribution for all sea5on5;0R 

2. You can fit different distributions for different 
seasonspOR 

3. You can fit all available distributions for all seasons'crlf > 

(assert (stop) ) 
else 

(if (eq ?t a) 
then 

(fprintout t 'l.You can fit either one OR all available distributions'crlf ) 
(assert(stop) ) ) ) ) 


(defrule rule20 
(oeclare (salience -1000)) 

(stop) 

=>• 

(fprintout t •PROCESS IS BEING STOPPED; 

RESET THE PROCESS IF YOU WANT TO CONTINUE 
WITH OTHER SET OF DATA'crlf) 

(fprintout t 'Do you want any help ? ') 

(assertthelp -(read)))) 
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(defrule rule21 
(declare (salience “S90)) 

{su93CstiGri ?rju») 

(esad ?string> 

(advice ?riu» ?stran 3 ) 

(assert(stop> ) ) 

(defrule rule22 
(stop) 

(help ?hl) 

(if (eq no '?hl) 
then 

(fprintout t 'SugsestionE passed should be taken in sequencial order'crlf) 
•(halt))) 

(defrule rule2-4 
(help ?hl) 

=> 

(if (eq yes ?hl) 
then 

(fprintout t *1 can help you in following ways — AAA 

1. Different types of distributions 

2. Different types of transforaations 

3. Different types of goodness of fit tests 

4. References 

6. Reasoning for any concluding renarks'crlf ) 

(fprintout t ‘HAY I HELP YOU^ •) 

<a£sert(help required =(i‘ead))))) 

(defrule rule25 
(help ?hl} 

(help required ?hp) 

=> 

(if (eq ?hp yes) 
then 

(fprintout t ‘Type the serial nunber ’crlf ) 

(fprintout t 'Which help do you need? *) 

(assertChelp needed =(read))) 
else 

(fprintout t 'Then why did you disturb «e????'crlf ) ) ) 

(defrule rule26 
(help needed ?hn) 

(help ?hl) 

r> 

(if (= ?hn 1) 
then 

(fprintout t 'The following types of distributions are noraally 

used in hydrology AA AAA 

AA. NORMAL DISTRIBUTION 

AA. LOG-NORMAL 2 DISTRIBUTION 

AA. LOG-NORMAL 3 PARAMETER DISTRIBUTION 

AA. PEARSON TYPE 3 DISTRIBUTION 

AA.LOG-PEARSON TYPE 3 DISTRIBUTION 

AA.TYPE 1 EXTREMAL 'DISTRIBUTION 

AA.TYPE 3 EXTREMAL DISTRIBUTION'crlf ) 


else 
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(if (« ?hn 2) 
then 

(fprintout t ’The followins transforaations are senerally 

use<i in HTfDPOLOGy A^AA 

^*.106 

ROOT 

AA.INVERSE-PEARSON 
AA.TWO STEP POUER'crlf) 


else 

(if (* ?hrf 3) 
then 

(fprintout t •Two tests can presenty be done 
He are extremely sorry for these limitations 
Anyway, they are sufficient for hydrolosic data analysis 
A— A. CHI-SQUARE TEST(a parametric test) 

A— A.KOLHOGROV-SHIRNOV XESKa non-par ametric test)*crlf) 

else 

(fprintout t *1 can't help much. See the following references 

1. Applied Hydrology BY V.T.Chow I Haidment 

2. Handbook of Hydrology BY V.T Chow 

3. Frequency and Risk Analysis BY G.U.Kite 
4 . Statistical Hethods In Hydrology by Haan 

5. Hydrology by Raudkivi 

6. Probability and Statistics in Hydrology bY Y.Yevjevich 

7. Hydrologic Frequency Modeling, edited by V.P.Singh'crlf ) ) ) ) ) 



